|
|
| Welcome to the Mac Forums forums. Please read the FAQ if you have questions. Register to participate. |
|
|||||||
| TouchArcade.com - iPhone Game Reviews and News |
![]() |
|
|
Thread Tools | Search this Thread | Display Modes |
|
|
#1 |
|
macrumors 65816
Join Date: Aug 2008
|
Intel's strategy... is it flawed? (eg. Gulftown)
Intel's strategy with Nehalem (and it's 32nm derivative Gulftown) appears to be to develop an amazing server microarchitecture and then push that technology down into other segments such as the mainstream (Lynnfield) and mobile markets (Clarksfield).
The other interesting element to their strategy is that with the die shrink accompanying Gulftown, they've opted to use some of the added die real estate by upping the core count from 4 to 6 and upping the L3 cache from 8MB to 12MB (which is a complete waste of silicon if you ask me). What's also notable is that early reports of Gulftown have them clocked at a very conservative 2.4GHz. While this might seem like a suitable approach to take, particularly if you are focused on making an ideal server chip, it doesn't seem to make much sense for mainstream computer user (or even workstation users), who will be inheriting this technology as the chip of the day in about 12-18 months. I personally think that this strategy on increasing core count and cache and neglecting clock speed is going to ultimately cause Intel grief. It's already to the point where very few can benefit by upgrading their computer. If you are an average PC user running a Core2 processor, there's very little benefit to upgrading to Penryn or even Nehalem. Gulftown's just going to climb further up the diminishing returns curve. If you are a pro user, such as those here, there's no compelling reason to upgrade from Penryn to Nehalem... and it looks like there will be even less reason to upgrade to Gulftown. Sure it's a 50% increase in core count, but how often is that actually useful and what are the tangible benefits? I think Intel is going down a difficult path. In the old days, going from a 400MHz CPU to a 800MHz CPU made a huge difference to your computing experience. Today, going from a 2 core system to an 8 core system is virtually unnoticeable for most. Even a hard-core user really has to think long and hard to justify an Octo core over a quad because the situations where it will provide any benefit at all are fairly rare. I see a period now where most users really have no compelling reason to upgrade their computer. This is not good for Intel. If it was up to me, I would offer a line of Intel processors with lower core counts and less cache with higher clocks. Surely if Intel can make a 6 core processor with 12MB of L3 at 2.4GHz, they could make a dual-core processor with 2MB of L3 at a clock of 5GHz and maybe a quad-core with 4MB of cache that runs close to 4GHz. Which would interest you more?
__________________
Mac Pro (2009 Quad), Mac Mini (2009), MacBook Air (Rev A), iPhone (3G)
|
|
|
|
| VirtualRain |
| View Public Profile |
| Find More Posts by VirtualRain |
|
|
#2 |
|
macrumors 603
Join Date: Jul 2005
|
you assume software will not take adv of these cores
thats the direction software is taking
__________________
Old Bike: 2008 Kawasaki Ninja 250r Green Current Bike: 2003 Suzuki SV650S Copper
|
|
|
|
| dukebound85 |
| View Public Profile |
| Find More Posts by dukebound85 |
|
|
#3 |
|
macrumors 68020
Join Date: Oct 2008
|
You have to think much further than incremental steps.
Intel is going through a strange transition, where the new immersion and scanners are ahead of chip design. Therefor their design teams can literately put the cutting edge chips into prodcution sooner than anticipated, they don't have to wait. Consumers will be offered a more powerful chip than before; pushing the gap of processor power further ahead of the applications taking advantage of the technology. Pushing forward will mean more die per wafer, lower cost per die, being 2 to 3 generations ahead of any competition, ramp the newer factories up to full production and get them paid off and really start focusing on 15inch or 450mm processes. One thing you forgot is, todays fabs making the state of the art processors will be chip set plants 2 to 3 years from now. Remember when we all thought a 1G harddrive was a lot of space. Same with processors, multicore will start being better utilized and before we know if 16 core will be common talk
__________________
Jailbreakers protect yourself against worms click here. HEED to Apple's warnings, you jailbreaking thieving criminals. Think differently, buy a PC |
|
|
|
|
|
#4 | |
|
Thread Starter
macrumors 65816
Join Date: Aug 2008
|
Quote:
The only unrealized potential gains to be had are those that leverage OpenCL and the hundreds of cores in the GPU. But OpenCL actually makes the core count in the CPU less relevant. A high-clock dual-core processor with a high-end GPU and OpenCL enabled software would be more valuable than a 6-core CPU.
__________________
Mac Pro (2009 Quad), Mac Mini (2009), MacBook Air (Rev A), iPhone (3G)
|
|
|
|
|
| VirtualRain |
| View Public Profile |
| Find More Posts by VirtualRain |
|
|
#5 |
|
macrumors member
Join Date: Jul 2009
|
Higher clock speed can be an issue. I think 3.5 GHz is the limit for clockspeed in a processor. Remember the Netburst Mircoarchitecture that made clockspeed the main selling point. But by 3.8 Ghz the chips were too hot and the chips chewed up loads energy. Amd's lower clockspeed chips performed better than the higher clocked Pentium 4 chips.
__________________
15" Unibody Macbook Pro mid 2009,2.66 GHz, 4 GB DDR3 Ram, Nvidia 9600M GT, 320 GB HD; ipod touch 2nd generation 16 GB
|
|
|
|
|
|
#6 | |
|
macrumors 603
Join Date: Jul 2005
|
Quote:
clockspeed has gone well past 3.5 ghz and 3.8ghz
__________________
Old Bike: 2008 Kawasaki Ninja 250r Green Current Bike: 2003 Suzuki SV650S Copper
|
|
|
|
|
| dukebound85 |
| View Public Profile |
| Find More Posts by dukebound85 |
|
|
#7 | |
|
macrumors 68030
Join Date: Sep 2006
Location: England
|
Quote:
Just because they can do something and the hardware would be better suited to things that doesn't mean it makes sense to do so from a business perspective. |
|
|
|
|
|
|
#8 |
|
macrumors 68020
Join Date: May 2009
Location: Munich, Germany
|
I think that Intel's strategy makes a lot of sense. It is classic tick tock and they execute it very nicely in my view.
2009 was a tock year with the introduction of the Nehalem micro architecture. I see Intel's innovation with the increase in bandwidth quite impressive. It is essentially Apple with their poor implementation of the RAM and PCIe design who messed up the product quality from Intel. You can buy competitive systems with 18 RAM sockets and four 16 lane PCIe slots. 2010 will be a tick year dedicated to the 32nm fab standard. So under normal conditions you would not expect to see radical changes. That Intel manages to increase the core count by 50% is a nice bonus in my view. I agree that the Gulftowns will most probably come in different clock speeds up to 3,3 GHz even if the initial clock speed is only 2,4 GHz. The new technology will then come in 2011 with the Sandy Bridge micro architecture.
__________________
MP4,1●3,33GHz Octad●160GB SSD RAID0●12GB RAM●ARC-1210●Blu-Ray●FireDTV-S2●SL/Win7-64 uMBP●160GB SSD●SL/Seven64 TC 2TB iPhone●16GB |
|
|
|
|
|
#9 | |
|
macrumors Demi-God
Join Date: Nov 2008
Location: Denver, CO
|
Quote:
I think everyone's getting a little ahead of themselves with these 6 core Gulftowns....
__________________
[Gandalf-Q8200|Server 2008 R2] [Durin-3.2 W5580 Quad|10.6.2] [Arien-iMac|10.6.2] [Arwen-G4 Mini|Server 10.4] [Aragorn-G4 Mini|Server 10.4] [Frodo-G4 Cube|10.2] |
|
|
|
|
|
|
#10 | |
|
macrumors 68030
Join Date: Jan 2008
|
Quote:
![]() Anyway, to answer your second question about what would interest me personally, it would be different technology all together first and foremost. After that though being stuck with the current tech base, I want to see both parallelism and clock speed progress significantly and incrementally. I want very much to have processors available with 64 physical cores operating at 6 or 8 gigahertz per core.
|
|
|
|
|
| Tesselator |
| View Public Profile |
| Find More Posts by Tesselator |
|
|
#11 | ||
|
macrumors 601
Join Date: May 2008
|
Quote:
The one thing that came out of Nehalem that will matter IMO, is the move to both QPI and the IMC when it is downlined to every family. We need to solve bottlenecks as much as increases in clock speed, as little software is dependent solely on the core (clock speed). Quote:
It's going to be awhile before we leave semiconductors it seems. I do recall a sub 10 nm process has been developed, and I'd be totally amazed if they scrap it, and move to another technology, such as optical. Those systems will have to be developed for the enterprise market first (really high end, and almost certainly first release as a supercomputer), before it ever trickles down to the more common systems, even in the enterprise lines.
|
||
|
|
|
|
|
#12 |
|
macrumors 6502a
|
I am definitely interested in more cores. Upping the clockspeed instead would just take us back to Intel’s days of the Pentium.
__________________
2.6GHz 15" MacBook Pro (Penryn) / 4GB RAM / 500GB 7200RPM HDD / 8600M GT 512MB 32GB iPhone 3GS |
|
|
|
|
|
#13 | |
|
macrumors 601
Join Date: May 2008
|
Quote:
When it happens, you pay through the nose though. For example, look at the SP and DP Xeons at the lowest to highest clocks in the Nehalem family. Ouch. $284 - $999 for the SP versions, and it's worse for the DP models.
|
|
|
|
|
|
|
#14 | |
|
macrumors 6502a
|
Quote:
__________________
2.6GHz 15" MacBook Pro (Penryn) / 4GB RAM / 500GB 7200RPM HDD / 8600M GT 512MB 32GB iPhone 3GS |
|
|
|
|
|
|
#15 |
|
macrumors 6502
Join Date: Apr 2004
|
I actually just sat in on a presentation by an Intel engineer on stuff like Nehalem and Gulftown the other day...
To answer your question... • It is going to be physically impossible to keep pushing Ghz. Intel tried to go for 4 ghz, and without extreme cooling methods, it just won't happen. Increasing in Ghz creates exponential heat increases, increasing cores creates linear heat increases, so cores are the best option moving forward. • Yes, software has trouble taking advantage of cores, and eventually we actually likely hit a wall of the number of cores software can take advantage of, which leads me on to the next point... • The biggest holdup in speed increases isn't actually the processor. Memory speeds have not increased as fast as processor speeds, and the speed penalties for going out to memory or GPU are awful and can stall CPU processing. • Intel already took the first step of dealing with this in Nehalem. On Nehalem the speed penalty for going out to memory is greatly reduced because the memory controller is now on CPU. Now instead of hoping over multiple buses to get to memory, memory is just a stones throw away from the processor. This is also why on the new Mac Pros, the memory has been moved physically closer to the processor. The length of the connections on the motherboard can make an impact, and they were reduced to lower time to memory. • The next problem is the GPU. In the future Intel wants to move the GPU onto the CPU for the same reason. The time for the processor to get over the PCI bus to your graphics card can be extreme in computer science times. The solution is to move your GPU onto the CPU so that they are right next to each other and the talk time is almost 0. • In the future, when you buy your 16 core CPU, it might come with one or two cores configured not as CPU's, but as GPU's. So you'd have a 14 core CPU/2 core GPU (with the GPU cores having many processing sub-cores as part of their silicon, much like a normal GPU.) • This does bring up the question of whether it's nicer to have a separate discretely upgradable GPU, but that's another topic.... • Intel is talking about having a full feature GPU core on your CPU, not like Intel Integrated graphics. Anyway, hope this helps a bit. |
|
|
|
|
|
#16 | |
|
macrumors 601
Join Date: May 2008
|
Quote:
So it's up to individuals to do so if they wish. Intel at least is offering that possiblity without intentional hinderances, and the board makers are willing to cooperate as well (allowing access to such settings in the consumer market). As there's not nearly the interest in such endevors in the enterprise market, the boards (particularly the DP boards) aren't OC friendly. No access to the settings, and so far, there's been no release of an enthusiast board for such systems (DP workstations for example, as the SkullTrail was for the 54xx parts). |
|
|
|
|
|
|
#17 | |
|
macrumors 6502
Join Date: Apr 2004
|
Quote:
I know overclockers can do it, but there is no way Intel can put out enough chips that can be supported in a 4 ghz configuration. And usually overclockers have much more advanced cooling systems. And again, at 4 ghz, you're going to be going way faster than your RAM can keep up with, which is why they're already having to increase cache sizes. Power bills are honestly the smallest issue with going 4 ghz. |
|
|
|
|
|
|
#18 |
|
Thread Starter
macrumors 65816
Join Date: Aug 2008
|
Nice insights goMac!
However, I still think Intel's strategy (whether by choice or forced on them by physical limitations) is providing users with little reason to upgrade. It's hard to believe, but for the average computer user, the move from 65nm to 45nm to 32nm will have netted very little increase in performance. The only ones benefiting from the last couple generations of Intel processors and likely the next couple generations are those with hard-core rendering/encoding workloads. I use to upgrade my PC with every new Intel processor as did a lot of other computer enthusiasts. Now, few are upgrading. Sadly, a 6-core Gulftown at 2.4GHz is not an upgrade to someone running a 65nm Q6600 quad! ![]() Anyway, Apple can create compelling reasons to upgrade their computer line without significant improvements in CPU's but I feel sorry for the other PC manufacturers who are faced with trying to market this stuff as much improved. ![]() EDIT: goMac... I think it's important to point out that Nehalem's move to an IMC was huge from an architectural perspective, but the real world gains are negligable... here's why... Back when the memory controller was on the NB and accessed across the slow FSB, Intel opted to workaround this high-latency situation by using massive L2 cache on the Core2 line of processors (8MB for quad cores). This meant that a cache miss was a rarity, masking the slow memory architecture. Now enter Nehalem with it's IMC, something AMD had been doing for a couple of generations. While this dramatically improves memory access, it's real advantage is that you can free up silicon that was previously spent on massive cache. However, for some odd reason I've yet to understand, Intel continues to spend large amounts of silicon real-estate on ridiculous L3 cache sizes. 8MB with current CPU's and 12MB with Gulftown! (Whereas I think AMD runs something like 1MB/core). That's insane for an IMC. It's proven that you can run your memory in single-channel, dual-channel, or tri-channel, and it won't make any difference... because the massive cache is masking any memory latency issues. Why Intel continues to use huge L3 cache sizes is a mystery to me. Apparently cache errors are also one of the key factors in limiting clock speeds... and why on Nehalem, the cache runs on a separate power and clock plane from the cores. Thus smaller cache sizes would enable higher clocks.
__________________
Mac Pro (2009 Quad), Mac Mini (2009), MacBook Air (Rev A), iPhone (3G)
Last edited by VirtualRain : Oct 17, 2009 at 08:16 PM. |
|
|
|
| VirtualRain |
| View Public Profile |
| Find More Posts by VirtualRain |
|
|
#19 | |
|
macrumors 6502
Join Date: Apr 2004
|
Quote:
The CPU is going to have to keep hitting main memory for more frames, and then ship them to the GPU.If you're playing back off disk, like most consumers are, it's kind of a mute point in since disk is so much slower than main memory. Main memory speeds are going to be the least of your problems. But, as consumers start getting more and more RAM in their computers, and RAM speeds still can't keep up, the disparity is going to get worse. If you have a consumer machine with 32 gigs of RAM in 5 years, a CPU with 8 megs of cache is going to be struggling badly to keep up with everything you've got in RAM. This of course assumes that we're going to have software that actually fills up 32 gigs of RAM, but considering I started with Macs back when the entire system fit in 2 megabytes, I wouldn't be surprised. Microsoft seems to be adept at finding ways to get Word for Mac to eat up more and more memory. ![]() Also, 16 cores fighting over 8 megs of cache doesn't sound fun either. But you're right, Harpertown is still very competitive with Nehalem. I still recommend Harpertown Mac Pros to people who are interested in buying a Mac Pro. I have a Harpertown at home and a Nehalem at work, and while the Nehalem is a bit better in dealing with virtual machines, my Harpertown is every bit as fast. But Nehalem is really Intel banking on the future, when there are a lot more cores fighting over a cache that's trying to handle a lot more memory. IIRC the new i7's have 12 megabytes of cache, but the quad i7's only have 8 megs of cache because there wasn't enough room on the die, which could be another problem. More cores might mean less room on the die for cache, which means we'll need faster memory access. |
|
|
|
|
|
|
#20 | ||||
|
macrumors 601
Join Date: May 2008
|
Quote:
![]() BTW, to me, even the cooling in an enterprise environement is tied to power useage (HVAC run harder, requiring more power, or even an upgrade to larger unit/s, using additional power). It all comes down to financial practicality. Single systems (larger cases, not crammed in a rack), have more options, and even those are left up to the individual. There's possiblities, such as direct liquid cooling to the processors, such as IBM's newest Power chips (constructed into supercomputers), but it's not cheap enough yet. It would take a massive economy of scale to get it into end user systems. Again, cost is the primary limitation, and Intel really doesn't have enough incentive yet. Maybe if they hit a wall, and nothing else is ready tech wise. Timing will be critical, but I doubt such exotic cooling systems will ever make it to the mainstream. Quote:
QPI and the IMC was a step in the right direction. If we can just get a drive tech fast enough at really low cost, the results will be amazing.Quote:
Quote:
![]() With current parts, cache works out to 2MB/core. Now take into consideration Hyper Threading. That's were they derived the value from. Same core count, but 2x the threads, so additional cache is needed. It doesn't do much ATM, and HT seems to be too buggy to be worth using as it exists now (from what I've seen and read). But down the road, it will. |
||||
|
|
|
|
|
#21 | |
|
macrumors 6502a
Join Date: Jan 2008
|
Quote:
Yes, I was one of those people who complained about CS4 not being 64-bit. "Nobody works with file sizes larger than 3 GB anyway." Nobody who doesn't have access to a printer that goes up to 54" wide, that is. At any rate, good thread. I have to imagine that SSDs are getting close to the point where the storage bottleneck stops applying and we can worry about other things instead. Right now, everybody who isn't using OCZ Vertex or Intel X-25M drives is at the storage bottleneck. I also kind of like the idea I heard before about a 2.5" or 1.8" SSD right on the logic board large enough to hold the OS for fast access. Maybe we'll see it in future chipset designs.
__________________
2.8 Penryn Octad MP+10GB RAM+Flashed 4870+Velociraptor+Logitech Z5500+Blu-ray+DS-263N+NEC20" NikonD80+Glass Video Editing+Photography+Design |
|
|
|
|
|
|
#22 | ||||||
|
macrumors 6502
Join Date: Apr 2004
|
Quote:
The silicon has limits where you are going to start hitting data corruption.The solution is to have multiple cores, each with their own memory bus. Just like with CPU's, it's easier to increase your RAM access speeds by adding buses in parallel instead of just pumping up the frequency of one memory bus. So still, even if you say "make memory faster!", it still leads you to needing multiple cores. There isn't any escape. ![]() And it's not like it's THAT bad. Even if software is written as single threaded on OS X (which is very rare these days), most of the underpinnings of the system are multithreaded anyway, which means you're probably still getting multicore performance out of a single core app. Hell, I wrote an app a few years back that I didn't do much in the way of explicit threading in, and with all the system libraries it had 16 threads. Quote:
Quote:
Quote:
HyperThreading has been around a long long time. Back in 2005 when I was playing with the Intel Mac Developer boxes they all had HT. I really doubt it's got bugs at this point. ![]() Also, HyperThreading doesn't necessarily need more cache. Ideally, you're pipelining instructions working with the same data, so you shouldn't need to go back to memory. Of course this is a concern really for the developer. I don't know if stuff like GCD takes cache into account, but eventually the developer probably won't have to worry about this sort of thing as much. Quote:
![]() Quote:
![]() (Feel free to let me know if my above numbers are wrong. I just went to Wikipedia.) Last edited by goMac : Oct 17, 2009 at 10:43 PM. |
||||||
|
|
|
|
|
#23 | |||||||
|
macrumors 601
Join Date: May 2008
|
Quote:
Quote:
The busses still have limits, and can't be exceeded, so the semiconductor drives are fixed to the bus max. If you hit the limit (where things end up shared), the chipset has to switch it. This could be addressed, but likely won't due to fixed transistor counts/cost per unit to keep yields high. Quote:
![]() Everything I've ever used that was multi-threaded, was developed that way. Quote:
Currently, it's throttled to ~660MB/s on the Nehalem systems. I had been hoping it was just the SP systems, but apparently it seems the DP units are affected as well. But both also only use a single chipset. Perhaps the second chipset would solve it, as QPI is doubled. So what I'm referring to, is just widening QPI to prevent such issues. Die size may have been the limiting factor, but Gulftown is slated to use the same chipsets without a redesign (makes financial sense for Intel).Quote:
Quote:
Quote:
DRAM is a possible solution now, but it's expensive and it requires a battery (not that trustworthy IMO).Keep in mind, that the disparity can also be mitigated by running multiple channels, and is what the faster SSD drive controllers are already doing. At least those from Intel, Indilinx, and Samsung are. There are solutions, but they've not gotten the costs down enough to allow for wide spread acceptance as of yet. The SSD market still needs time to mature. This is more of the point I was referring to. When the average computer ships with SSD as standard for example. Then the enthusiast/performance users can actually obtain drive throughputs that can blow our minds (compared to what we're used to now). |
|||||||
|
|
|
|
|
#24 | |
|
Thread Starter
macrumors 65816
Join Date: Aug 2008
|
While there's no arguing that there are a ton of bottlenecks elsewhere in the system to be addressed, the key issue at hand here, is whether there's much in the last couple of CPU offerings or the next couple coming down the pike that will compel people to upgrade their systems.
I maintain that central processing has hit a plateau and stalled and I'm not hearing any arguments to the contrary. We need more clocks. Or at least more aggressive turbo boost. A quote from Anandtech... Quote:
They want less cores and more clocks too.
__________________
Mac Pro (2009 Quad), Mac Mini (2009), MacBook Air (Rev A), iPhone (3G)
|
|
|
|
|
| VirtualRain |
| View Public Profile |
| Find More Posts by VirtualRain |
|
|
#25 | |
|
macrumors 601
Join Date: May 2008
|
Quote:
Also, that article is aimed at the mobile market, not desktop or enterprise. The dual vs. quad core for a desktop is arguable, as some do more multi tasking than others, and possibly a few run multi-threaded apps often enough it makes sense for them, without a battery to worry about (run times). I do think most can hum happily along on a dual core, but there are some exceptions. Laptops are another issue however, as there is always battery life to contend with, and compromises must inevitably be made.
|
|
|
|
|
![]() |
| Thread Tools | Search this Thread |
| Display Modes | |
|
|