1. The 2009, 2010, 2012 Mac Pros have one big fan built into the PSU, one big fan dedicated to cooling the PCIe cards, one big fan to suck air over the CPU/memory board, one big fan to suck air out of the back of the bottom of the case and a small fan to cool each CPU; so if you have a dual CPU that's six fans or if you have a single CPU that's five fans. Are we to believe that Apple has been wasting money in the past by using five or six fans? Definitely, not. While it is true that innovation requires thinking differently, but it does not require one completely going foolishly insane. Thinking differently in many aspects may require thinking like one did in the past in another, particular aspect.
2. Putting two CPUs in that tube would just exaggerate the problem. Sometimes one can innovate to the point that they have to think differently in every way and thinking differently may mean going back to thinking like one did once in the past, but not so far back as the Cube completely. One reason why, in the past, Apple didn't didn't give us Mac Pro systems with the absolute fastest CPUs was TDP related. They wanted to give us fast systems that were quiet and cool so that throttling didn't occur often. My suggestion to Apple would be not to abandon the notion that less is often more; so they need to use a Ivy Xeon with a TDP of 115 or less so that they can run one or more of the following CPUs full out:
Xeon E5-2640 V2 : 2.0 GHz, 20 MB L3, TDP 95 Watts (8-Core) (8 x 2 = 16 GHz)
Xeon E5-2660 V2 : 2.2 GHz, 25 MB L3, TDP 95 Watts (10-Core) (10 x 2.2 = 22 GHz)
Xeon E5-2695 V2 : 2.4 GHz, 30 MB L3, TDP 115 Watts (12-Core) (12 x 2.4 = 28.8 GHz)
because of TDP (only one system fan) and steady step up in speed.
3. In building my 2010 Hackintosh (see GB2 score of 40,100 in my Signature, below,) the issue I faced was thermal dynamics/throttling, i.e., how to keep those two Xeon 5680s cool, but run those CPUs fast. At first, I tried overclocking them greatly (but that just made the problem worse) and then I tried overclocking them less and less until I arrived at the idea of underclocking them to run them at speeds of under 2.5 GHz at idle, but magnifying their TurboBoost potential so that they had turbo bins of 13, 13, 13, 13, 14 and 14 for each 6-core CPU to give me the performance I need for rendering. Then I started to achieve Cinebench 11.5 scores in excess of 24.5 and since I use, among others, Cinema 4d - lighting fast renders. But this one fan nonsense has significant limits, especial when there's no external brick for the PSU because it's also in the cylinder. Like Officer Harry Callahan would say, "One's got to know his system design's limitations" and I'd add: "and work around them."
4. Titans cards would have to be modified to be useable; I like this (my Tyan-based WolfPackAlphaCanisLupus0 and am thinking about building another one):