Originally posted by Cubeboy
Macrumors12345:
Looking back, I see your point, so I'm going to do another comparison which compares the CPUs in terms of how they perform in each filter.
Wow, thanks for the analysis! That was a welcome sight!
I'd still really like to see a comparison of those results with the results of the G5 w/o plugin to see which of those are optimized.
Anyone care to speculate what causes each unit to perform so much better on certain tests?
Rotate 90 is largely a memory op so maybe the caching on the P4 helped it there? It can't fit the whole image in cache, but if the filter is designed well it could exploit it pretty well.
Rotate .9 is largely a floating point op, so the G5 caught up? We know the P4 FPU is weak.
I'd expect these to have been optimized.
What's up with the divergence in the gausian blur numbers? Does the larger radius blow the G5 L2 cache?
RGB-CMYK sounds floating point with a small memory footprint (almost a register op)-- advantage G5?
Lens flair also sounds like a memory user-- advantage P4?
I know so very little about how most of the operations work, that all I can hope to do is spark someone elses interest and get real information.
I am curious though whether the lack of an L3 cache hurts the G5, even with the fat pipe to main memory.
The duals might help a bit here since they can snoop each other's L2 caches.
For the case of upoptimized ops, it sounds like there are common G4 Altivec ops that cause the G5 to stick (unfortunate artifact of switching chip vendors and a potential problem for compatibility)-- so that might explain why some operations seem to give such a huge advantage to the P4.
I think any op that spun up the disk is invalid, but we don't know which those were...
Is there anyone out there who has designed filters like these?