quagmire said:
First to note, that Virginia tech is not on the list because they do not have a supercomputer running so they had to drop out.
With the rumored 2.3 Ghz G5 Xserve update couldn't we see alot more speed out of them. Apple doesn't tell us the speed and I think everyone is presuming 15 Teraflops is with the 2.0 Ghz dual G5 Xserve. How do you calculate the speed of a super computer made up by 1500+ running dual 2.3 Ghz G5. With the itanuim supercomputer running at 20 tf, and we reach 25 tf hello 2nd.
The theoretical performance of a PPC970 is
2 Floating point units X 2 flops per clock cycle x the GHZ of the CPU
Therefore 1566 x 2 cpus ea X 8 GigaFlops = 25.06 TF Theoretical
SO I would assume they are using the 2.0s
Now they claim they will get about 15 TF actual
15/25= .60
So they are assuming that they can reach about 60% efficientcy on Linpack
That is fairly high percentage considering the use of Gigabit Ethernet, but it could be attainable with some good performance tuning.....
1st it would depend on how much memory the nodes have.... the more the better.... because a larger LINPACK problem can be run which will cause the system to spend more time calculating than sending messages...
2nd. The compiler version makes a huge difference......If it was my cluster I would use IBM's xlf compiler.
3rd. The BLAS libraries you use are very important.... I saw the other day that GOTO has just finished a set of customed tuned BLAS libraries for the PPC.... If it was my system I would use them cause GOTO writes the best.
4th. Even though they selected Gig E ..... all Gig E isnt created equal.....
Some are much better than others.... and suprisingly some have fairly low latency......Example Extreme's Blackdiamond is even as low as 10 microseconds....
Now they didnt say whose Gig E they used.... But if they went with Extreme that could help.
5th The MPI you use maters a good bit too..... 3 real choices here....
MPICH.... free
LAM..... free
MPI PRO.... expensive
While mpipro is the best overall
LAM is very quick on TCP/IP
Mpich is more like the middle of the road and kind of the standard in research systems.
6th and Final.... Linpack has a lot of tuning parameters.... If these guys are really good they can squeak some extra FLOPS out of tunning linpack.
So who knows.... maybe they will get 15 TFlops.... heck maybe they will get more.. maybe less....
but at least its another MACINTOSH and a win for APPLE!!!!