So clearly the bandwidth is a limiter/constraint...
So estimated time for shader to complete based on bandwidth constraint
would be:
628.320.000.000 bytes / 5.368.709.1200 bytes / sec =
628320000000 bytes / 53687091200 bytes / sec = 11.7 seconds.
So instructions per second exected would be:
104.720.000.000 / 11.7 = 8.950.427.350 instructions per second.
For two warriors that would mean 4.475.213.675 cycles per second.
Let's see.. a dual core cpu achieves 16.000.000 cycles per second.
The gpu performance would be staggering/very good.. however I have a
feeling there must be another bottleneck/constraint somewhere....
There could also be an execution constraint for the gpu.
Stats/specs say something like: Fill rate: 15.7 billion pixels/sec.
I think that's about:
15.7 * 1000 * 1000 * 1000 = 15.700.000.000
So far this seems within range of the number of above.
Conclusion: performance could be staggering/super speed !
Speed up over cpu would be:
4.475.213.675 / 16.000.000 =
4475213675 / 16000000 = 279.7
The gpu would be about 280 times faster than a cpu !
That's the kind of performance gain I am looking for ! ;)
Me very happy about that number ! =D
As long as the code will compiled this should definetly be achieveable !
However there is still a little catch... these numbers do not include the
initialization... this would
need to be done for each battle... but that's probably pretty quickly done
as well...
Even a 200 speed up would be real nice ! ;)
So these numbers are very encouraging and I will definetly continue my
development efforts to get a parallel gpu corewars executor going !




:)
Bye,
Skybuck =D