PDA

View Full Version : Intel document confirms Prescott dissipates 103 W




szark
Aug 18, 2003, 09:23 PM
From the Inquirer (http://www.theinquirer.net/?article=11092):

103 W can be cooled down with regular air heatsink but you have to be aware that this cooler would be a very massive and noisy number indeed.

And that's at 90-nm! :eek:



Powerbook G5
Aug 18, 2003, 09:44 PM
I'm not surprised...our new 2.8 GHz Dell overheats and smells like something is burning after 10 minutes if someone accidentally bumps the door to the computer desk closed...so a higher clocked Pentium...geez. My PII used to overheat and melt parts in my computer all the time on my old Gateway, too, I went through 5 modems in that damn thing before I switched to Apple.

MrMacMan
Aug 18, 2003, 09:50 PM
Hot Hot hot!

Its getting hot hot hot!

--Couldn't help myself.

Damn... I think they are gonna start to equip these things with Water Cooling!


No kidding!

Sun Baked
Aug 18, 2003, 10:00 PM
Cool, now they definitely could team up with Hasbro to upgrade the Easy Bake Oven to Pentium Power.

Crunch numbers and bake a cake at the same time.

pivo6
Aug 18, 2003, 10:02 PM
Originally posted by Sun Baked
Cool, now they definitely could team up with Hasbro to upgrade the Easy Bake Oven to Pentium Power.

Crunch numbers and bake a cake at the same time.

Oven? More like a space heater. Nothing like a dual purpose appliance in your room. :)

patrick0brien
Aug 18, 2003, 10:07 PM
-Holy Mother of effing Pizza!

Sorry, just felt good saying that.

Well, I guess this is what happens when you over build a CISC processor.

Yes, yes, I know that even the P4 has RISC execution capability - but it's a bolt-on. It's still a 1991 586 at heart.

ddtlm
Aug 18, 2003, 10:17 PM
patrick0brien:

The only thing "bolted on" is the x86 front end.

szark

Actually 90nm hasn't done as well as Intel was thinking, and based on that I'm thinking that IBM may find themselves in the same situation (not 103W, but more dissipation than they expected). Everyone should avoid laughing at Intel until we know how well IBM does...

patrick0brien
Aug 18, 2003, 11:27 PM
Originally posted by ddtlm
patrick0brien:The only thing "bolted on" is the x86 front end.

-ddtlm

Not true. It's the foundation.

Otherwise MS would have to start code-new.

BTW- This on isn't laughing at Intel - they are smart guys. I'm shaking my head because they are held from truly innovating because they need to maintain old MS software.

Yes, it sounds like bias, but it's just fact.

MS and Intel's greated strength, is also their greates weakness. They have dominance, but have also built in the inertia for change.

Powerbook G5
Aug 18, 2003, 11:45 PM
You do have to admit, it must take Intel some serious ability to exploit x86 technology to the point where it is now.

patrick0brien
Aug 19, 2003, 01:12 AM
Originally posted by Powerbook G5
You do have to admit, it must take Intel some serious ability to exploit x86 technology to the point where it is now.

-Powerbook G5

I absolutely agree, and my cup is up to Intel for continually increasing the performance of the x86 architecture, and to Microsoft for keeping Windows functional.

Both, being based on 70's tech - this used to be considered impossible. Well done gents.

BTW- This is honest, no sarcasm.

Powerbook G5
Aug 19, 2003, 01:19 AM
It makes you wonder...if they weren't limited by Windows legacy x86 code and Intel could fully go RISC or do whatever they wanted with their processor technology, how much could they wipe the floor with everyone else?

Catfish_Man
Aug 19, 2003, 01:29 AM
Originally posted by Powerbook G5
It makes you wonder...if they weren't limited by Windows legacy x86 code and Intel could fully go RISC or do whatever they wanted with their processor technology, how much could they wipe the floor with everyone else?

It's called Itanium. It's VLIW, kinda next-gen RISC.

Powerbook G5
Aug 19, 2003, 01:43 AM
Hasn't that processor proven to be a huge let down for Intel, though? I suppose it's more the fault of the PC industry not able to move beyond the normal Windows way of doing things, but still, it seems like they aren't having much luck.

ddtlm
Aug 19, 2003, 02:02 AM
Powerbook G5:

Hasn't that processor proven to be a huge let down for Intel, though?
Yeah, but it is probably still the fastest processor out there. (Seemingly due as much to impressive fabrication as anything else.)

patrick0brien:

The x86 ISA is not the foundation of the P4 nor any other modern x86 processor, it is only the language that the front-end translates to some back-end RISC ops. There is nothing 70's about the P4 except for its front-end (which is what makes MS happy).

crenz
Aug 19, 2003, 04:57 AM
Originally posted by patrick0brien
Not true. It's the foundation.

Otherwise MS would have to start code-new.

Not quite. The x86 instruction set is just a "frontend" and has little to do with the implementation of the chip. They can radically replace the design and still keep the same frontend. For a radical example of this, see the Transmeta processor designs. Their design has nothing to do with the original Pentium, yet MS didn't have to start "code-new".

It's a bit like saying that just because both Mac OS X, HP UX, Solaris and Linux lets you type "ls" and "cd" in the terminal, they are all the same.

However, I agree with what you said about inertia. Definitely the main reason for the Itanium's failure.

OutThere
Aug 19, 2003, 12:09 PM
Just start building the computers into their own micro-refridgerator cases, no worries about overheating!

whooleytoo
Aug 19, 2003, 12:26 PM
Originally posted by Catfish_Man
It's called Itanium. It's VLIW, kinda next-gen RISC.

Exactly, result: nice technology, terrible market share (a la Apple?)

From The Register:

"Dell managed to eke out 54 system sales in the fourth quarter of 2002 but saw shipments fall to 14 units in the most recent period. That's right. 14.

IBM boasted 34 sales in Q4 but saw Itanic servers disappear all together in Q1. What do we mean by disappear? Zero servers shipped. "

Obviously, slow and steady evolution, rather than revolution, is the way to increased market share.

Mike.

Backtothemac
Aug 19, 2003, 12:31 PM
My 2.6 gets really hot. But that system is a quiet as an iMac.

ddtlm
Aug 19, 2003, 12:43 PM
whooleytoo:

Heh, I thought the jury was still out on the Itanium being nice technology. :) I won't say anything positive about it other than "its pretty fast".

Powerbook G5
Aug 19, 2003, 12:50 PM
I am not sure how much power/heat the 2.8 GHz P4 puts out, but it is a whole lot. Our Dell system is so hot that after a month of use, it's already warped all the wood around the desk it sits in and the fan sounds like it's ready to spin right out of the case if you play games or do anything to make it work on something.

bousozoku
Aug 19, 2003, 01:08 PM
Originally posted by Catfish_Man
It's called Itanium. It's VLIW, kinda next-gen RISC.

next-gen? It's a direct descendent of HP Precision Architecture, which goes back to around 1990. The only thing PA RISC did for Hewlett-Packard was to unify their hardware lines.

Itanium proved a failed implementation, but Itanium 2 looks much better. By the next generation, it may actually be used in more than 200 machines.

macrumors12345
Aug 19, 2003, 03:31 PM
Originally posted by ddtlm
Powerbook G5:


Yeah, but it is probably still the fastest processor out there. (Seemingly due as much to impressive fabrication as anything else.)

patrick0brien:


It doesn't compare so well to Power4+. Core for core it does fine, but the key is that Power4+ packs dual cores on a similar sized die to Itanic's single core die, so in the end Itanic gets left behind.

Still, Itanic certainly leaves UltraSPARC in the dust, as well as the abandoned Alpha, PA-RISC, and MIPS lines, FWIW.

ddtlm
Aug 19, 2003, 04:03 PM
macrumors12345:

I don't think the 1.7ghz Power4+ compares well performance-wise core-vs-core to the 1.6ghz Itanium2, and I don't think dual-core or die space considerations matter. It may cost Intel a lot more to fab two Itanium2's than it costs IBM to fab a Power4+, but I don't think Intel cares at this point (still in the massive bleeding of cash stage).

macrumors12345
Aug 19, 2003, 08:02 PM
Originally posted by ddtlm
macrumors12345:

I don't think the 1.7ghz Power4+ compares well performance-wise core-vs-core to the 1.6ghz Itanium2

There is no such thing as a 1.6 Ghz Itanium 2, so I am sure that a Power4+ compares very well to it since it scores 0 on all benchmarks!

On the single-threaded SPEC benchmarks, the 1.5 Ghz Itanium 2 scores either 1075 or 1300 on SPECint2000 (depending on whether you want to use SGI's, Dell's, or HP's numbers) and between 1875 and 2100 on SPECfp2000. The 1.7 Ghz Power4+ scores 1100 in SPECint2000 and 1700 in SPECfp2000. So a single Power4+ core is about 5% slower in integer and 15% slower in floating point than an Itanium core, according to SPEC, although you should note that the Itanic scores appear to reflect the publicized "179.art cheat" that Sun found, whereas the Power4+ scores appear to be playing it straight. Nevertheless, the Itanic is slightly faster in single threaded code, but the key word there is slightly. I'm sure that the two cores will trade off leads in single threaded benchmarks with each new revision (note that Itanic was revised more recently than Power), as they have been for the past couple years. I would personally consider both cores to be pretty competitive with each other on a strictly core-to-core comparison, and I will say the same thing when Power is slightly ahead of Itanic, but perhaps you consider 10% to be a much larger gap than I do.

The important question, however, is whether these chips are used to run just one thread. The answer is no. These are first and foremost server chips, and nobody is going to run a server with just one thread active. The chips also see some use in workstations. There it is possible that you could care only about single threaded performance, but still unlikely, as evidenced by the fact that over 80% of HP's Madison workstation configurations come with dual processors (see http://www.hp.com/workstations/itanium/zx6000/reseller.html). So, in summary, SPEC CPU2000 is clearly *not* the benchmark you want to use to evaluate these chips.

and I don't think dual-core or die space considerations matter.

Wow, then you had better tell IBM, Sun, Intel, and HP this, because their long term plans are all heavily focused on producing multicore chips!

It may cost Intel a lot more to fab two Itanium2's than it costs IBM to fab a Power4+, but I don't think Intel cares at this point (still in the massive bleeding of cash stage).

If Intel doesn't care about costs, then why don't they just double the L3 cache from 6 MB to 12 MB? I'm sure it's technologically possible, and it would undoubtedly crank those SPECfp scores up a bit higher. Sure, it might cost more, but costs are irrelevant, right?

Look, in the end it is all about trade-offs. If IBM wanted to, they could replace the second core on Power4+ with more cache, and then the single threaded SPECfp2000 scores would shoot up even higher (that benchmark, as you probably know, is highly dependent on bandwidth). But they (and by implication their customers) feel that those transistors are better used on a second core, because they don't want to just run one thread at a time. Implicitly Intel/HP are also thinking this, since they are trying pretty hard to get a dual-core Itanic chip out the door sooner rather than later.

There is also a subtle point that many people miss when comparing single threaded SPEC scores between Itanic, which is a VLIW processor, and a more "conventional" RISC processor like Power4. Specifically, Itanium tries to wring everything it can out of instruction level parallelism, whereas Power4 focuses on getting performance from thread level parallelism. But since SPEC only runs one thread, it is essentially the case that Itanium is allowed to exploit its parallelism in this benchmark whereas the Power4 is restricted from exploiting its parallelism (since the benchmark only runs as a single thread). A better way to look at it is from the standpoint of the problem. To the extent that a task is inherently non-parallizeable, neither Power4 nor Itanium is probably going to do too well at it. But to the extent that the task is parallizeable, then Itanium can use its ILP advantage in SPEC, but Power4 is not allowed to use its thread level advantage in SPEC. In reality, of course, to the extent that the problem is parallelizable, any decent programmer would be trying to utilize both ILP and thread level parallelism.

macrumors12345
Aug 19, 2003, 08:08 PM
ddtlm:

Completely unrelated question: did you say at one point that you are running OS X off of a RAID Level 0 two disk array?

I was considering getting a second 160 GB drive for my G5 (when I eventually get it) to stripe, but I had read that you can't boot OS X off of a striped array, which would obviously throw a wrench in my plans.

Do you actually get any noticeable performance improvements out it? Honestly, it is not like I am editing huge media files on a regular basis...I was just thinking of getting some extra storage, and if I have it then I was thinking I might as well stripe it. But perhaps it is more trouble than it's worth.

Thanks.

ddtlm
Aug 19, 2003, 09:47 PM
macrumors12345:

There is no such thing as a 1.6 Ghz Itanium 2, so I am sure that a Power4+ compares very well to it since it scores 0 on all benchmarks!
Hmmm, I thought that both models existed.

...although you should note that the Itanic scores appear to reflect the publicized "179.art cheat" that Sun found, whereas the Power4+ scores appear to be playing it straight.
Not nearly such a big effect as when Sun did it though.

These are first and foremost server chips, and nobody is going to run a server with just one thread active.
I don't think its fair to try to count processor dies when comparing Itaniums to Power4's. Thats not how they are marketed, thats not how they appear to the OS, and thats not what you pay for. If you have two Itanium2's for every Power4 die then there is no thread advantage in IBM's hands.

Wow, then you had better tell IBM, Sun, Intel, and HP this, because their long term plans are all heavily focused on producing multicore chips!
IBM's multicore chip requires a lot of external baggage for for the L3. What good is going multicore if you can't fit all the cache on the chip? What are they gaining? Intel has more processor dies in a system to get the same number of cores, but they don't have external L3 everywhere. I don't see why you are especially excited about IBM's way of doing it.

I think going multicore is a good idea, but not important at this time. Intel figures that at 90nm they can put two cores and 18 MB (or something) of L3 on a die. Thats a multicore chip I'd get excited about, well if they adopted on-die memory controllers like the Opteron, anyway. (I wonder how much heat that'll throw off!)

If Intel doesn't care about costs, then why don't they just double the L3 cache from 6 MB to 12 MB?
Well they had to draw the line somewhere and 6MB on-die is a new record.

Look, in the end it is all about trade-offs. If IBM wanted to, they could replace the second core on Power4+ with more cache, and then the single threaded SPECfp2000 scores would shoot up even higher (that benchmark, as you probably know, is highly dependent on bandwidth).
You don't think that their 32 to 128 MB L3 is enough cache? Certainly adding yet more cache (presumably L2) on die wouldn't be a big deal. You should consider the reverse of your suggestion: what if Intel had put two Itanium2 cores together with a small L2 on die, and put 128 MB L3 off-die? The Itanium2 die is primarily cache; the core is apparently smaller than that of a Pentium4 at least, I forget just how big it is.

Specifically, Itanium tries to wring everything it can out of instruction level parallelism, whereas Power4 focuses on getting performance from thread level parallelism.
Not true. The Power4 is also a very aggressive instruction-level machine, as evidenced by the internal instruction packages which some have compared to Itanium's instruction bundles. Putting two cores on a die is nice and becoming more cost effective, but doesn't accomplish anything different that putting twice as many processors into the system. Trying to compare a dual-core Power4 to a single Itanium isn't fair.

Completely unrelated question: did you say at one point that you are running OS X off of a RAID Level 0 two disk array?
Do you actually get any noticeable performance improvements out it? Honestly, it is not like I am editing huge media files on a regular basis...I was just thinking of getting some extra storage, and if I have it then I was thinking I might as well stripe it. But perhaps it is more trouble than it's worth.
Yeah it seems pretty darn fast, but I don't do it for speed so much as just to make them into one disk. I had two available 36gig SCSI's from my Linux box after one of the drives had a problem which I didn't appreciate, but I didn't want to screw with distributing my stuff on two disks, so I just striped them and back up my data. :) I am booting off the striped RAID, BTW. They are the only disks in the system.

macrumors12345
Aug 20, 2003, 03:59 PM
Originally posted by ddtlm
Hmmm, I thought that both models existed.

I believe that the 130 nm Itanic comes in 1.3, 1.4, and 1.5 Ghz flavors (and varying amounts of cache). 1.5 Ghz is definitely the fastest though.


Not nearly such a big effect as when Sun did it though.

Not quite, but still pretty big. And since it is a geometric mean, having a really big outlier vs. a big outlier is not as big a difference as you would think. I think Sun probably managed to boost their overall FP score by about 15%, and HP by about 8%. Such is benchmarking...


I don't think its fair to try to count processor dies when comparing Itaniums to Power4's. Thats not how they are marketed, thats not how they appear to the OS, and thats not what you pay for.


Okay, but I thought we were discussing it from an engineering standpoint. If you want to discuss the chips from a commercial/marketing standpoint, then the discussion is already over, because I am sure that we both agree that to date Power4 has been a commercial success while Itanic has been a commercial failure. There is not even any comparison between the two of them.

If you have two Itanium2's for every Power4 die then there is no thread advantage in IBM's hands.

You mean besides the fact that IBM is making money on Power4 and Intel/HP are hemorraging it on Itanic? Once again, from a commercial standpoint there is no contest - the relevant debate is from an engineering standpoint (in which it takes two Itanic 2's to beat a single Power4+).

What good is going multicore if you can't fit all the cache on the chip? What are they gaining?

Performance, apparently. IBM certainly could have chosen to go with a larger on-chip cache and a single core, but they ascertained that going dual core would provide better performance. And apparently, their customers (and competitors) seem to agree.

Intel has more processor dies in a system to get the same number of cores, but they don't have external L3 everywhere. I don't see why you are especially excited about IBM's way of doing it.

I wouldn't say that I am "especially excited" about it - I don't particularly care that much one way or the other, except in so far as new Power chips mean new PPC 9xx chips. ;-) All I am saying is that it certainly seems to work pretty well for them.

Intel figures that at 90nm they can put two cores and 18 MB (or something) of L3 on a die. Thats a multicore chip I'd get excited about

You're a tough customer to please, huh? Well, that is planned for 2005 - I'm sure that IBM will have plenty of goodies packed into the 90 nm Power5+ as well.

You don't think that their 32 to 128 MB L3 is enough cache?

It's not at full speed. But for the record, those figures are per MCM, i.e. 8 to 32 MB per chip (4 to 16 MB per core).

You should consider the reverse of your suggestion: what if Intel had put two Itanium2 cores together with a small L2 on die, and put 128 MB L3 off-die?

My understanding is that they want to (put two cores on a chip), which is why they are rushing the dual core chip out the door in 2005 rather than simply waiting for their originally planned multicore chip ("Tanglewood?") to ship in 2006/7.

The Itanium2 die is primarily cache; the core is apparently smaller than that of a Pentium4 at least, I forget just how big it is.

Well, that is interesting. Hmmm...so why does it dissipate so much power (even in the low end 1.3 Ghz, 1.5 MB cache versions)?

Yeah it seems pretty darn fast, but I don't do it for speed so much as just to make them into one disk.....I am booting off the striped RAID, BTW. They are the only disks in the system.

Hmmm....I'll keep that in mind when I get the G5. It will probably depend on how lazy I am feeling (do I want to reformat the disks? am I going to be that religious in backing up?). Thanks!

ddtlm
Aug 20, 2003, 05:44 PM
macrumors12345:

If you want to discuss the chips from a commercial/marketing standpoint, then the discussion is already over, because I am sure that we both agree that to date Power4 has been a commercial success while Itanic has been a commercial failure.
No arguement there! :) Heh heh. I'm sure they're billions negative at this point, and increasing all the time.

Performance, apparently. IBM certainly could have chosen to go with a larger on-chip cache and a single core, but they ascertained that going dual core would provide better performance.
So you assume Intel went the other way (single core but bring L3 on die) because it offered lower performance? I don't see how sharing a die... sharing an L2... is going to be very useful for performance (I can see it helping some, but I also see huge on-die caches being helpful).

But for the record, those figures are per MCM, i.e. 8 to 32 MB per chip (4 to 16 MB per core).
True, when all chips are in use their share is reduced, but for single-thread SPEC one core does get to use all the L3 in the MCM.

Well, that is interesting. Hmmm...so why does it dissipate so much power (even in the low end 1.3 Ghz, 1.5 MB cache versions)?
Not sure. But one of the advantages of the IA-64 ISA is that it is supposed to move a lot of the complexity out of the processor and into the compiler. I've seen more precise ideas of how big a core it is thrown around on Aceshardware forums.