Surely it is both? Apple's philosophy is to push forward, on all fronts. [I'm hoping they will let Anand Shimpi talk about Fusion Architecture, similar to his last appearance, in between the M2 Pro/Max and the M2 Ultra.] It appears there is a sort of parallel or bidirectional process as the architecture moves between the A-series and the M-series -- by the time the platform makes it from A18 Pro in the iPhone 16 Pro to M5 Pro/Max in the MacBook Pro, it is tried-and-true. The A19 Pro, at the same time, benefits from that experience and builds on it.So the performance increase between M4 and M5. Is it due to architectural upgrades (the new cores for example) or is it the chiplets?
It's still a process-node refinement, from N3E to N3P, so that contributes -- TSMC has advertised a 5% performance increase with a 5%-to-10% power-efficiency improvement. [Tom's Hardware] So it helps, but it's not the heart of the matter.Because they are still using 3nm so that shouldn’t account for any of the uplift?
I just mean the new framework ("Fusion Architecture") may allow for changes that weren't possible when it all had to fit onto a single die. The new core structure probably isn't just a change in names, more die space means more design flexibility, with different limits.@DrWojtek @tenthousandthings I don't see how chiplets can have any part in improving performance. What am I missing?
The M5 Pro and M5 Max have separate dies for both the CPU and the GPU, right? Do you think they will be encapsulated together?
Do you think they will engineer two heatipes or dissipators, one for the CPU and a bigger one for the GPU? Maybe that’s not possible for the MacBook Pro, but for the Mac mini and Mac Studio…?
I mean, with those neural accelerators, the 20/40 cores GPU will get very hot in machine learning tasks… and if they share the heatpipe, the heat could be transferred to the CPU, which is usually not as hot, right?
I think WWDC is our best chance of seeing one this year, but timing is tight if they stick to what they've done in the past.
MacBook Pro M1 Max -> Mac Studio M1 Max/Ultra = 4 months, 23 days
MacBook Pro M2 Max -> Mac Studio M2 Max/Ultra = 4 months, 20 days
MacBook Pro M4 Max -> Mac Studio M4 Max/M3 Ultra = 4 months, 4 days
Anytime they've released a Max chip in a MBP and followed it up with a refresh for the Studio with that same Max chip, it's been about 4-5 months. WWDC would be closer to 3 months. I suppose there could be a late June/early July press release since it's just supposed to be a chip upgrade, no design changes.
Of course, Apple's free to change their timing on things whenever they please,
but I just don't see an October release for a Studio. I think if we don't see one by Summer, it'll be pushed to 2027, but both October and 2027 seem crazy far away when it seems like inventory of the Studio is drying up now. So to me, WWDC is looking really good.
While I do not know the exact mechanisms, the fact that it does is obvious, is it not? If it weren't improving performance, why do it at all?@DrWojtek @tenthousandthings I don't see how chiplets can have any part in improving performance. What am I missing?
Smaller dies give better yields. Getting more functional dies from a given wafer and lowering the overall cost per die.While I do not know the exact mechanisms, the fact that it does is obvious, is it not? If it weren't improving performance, why do it at all?
I wrote this for a different thread, but it applies here:The M1-3 Ultra is two dies and had one . The gap between the dies is extremely less than a mm. To put into proper context here is Apple's picture of the M1 Ultra package.
![]()
![]()
Apple unveils M1 Ultra, the world’s most powerful chip for a personal computer
Apple today announced M1 Ultra, the next giant leap for Apple silicon and the Mac.www.apple.com
The use of a horizontal layout inside the IC is probably confirmed now, since the original rumor from Ming-chi Kuo (screen shot of X post from December 2024 attached, now confirmed to have been accurate with regard to SoIC, at least) specified something called "SoIC-mH" (where "H" = horizontal) -- unlike Gurman, Kuo is more rigorous and does not mix-and-match things that he hears. So I think it's a safe bet.The next generation TSMC SoIC technolgy comes in two forms.. Either some dies layered on top. That is even less possible to posit two different spreaders (and even more thermal coupling between the dies). The other is horizontally like the older technique used in the Ultras so far. (there is a small 'bridge' die and the two are stacked on top of that. ). CPU and GPU split very heavily points to the horizontal set up.

The only thing obvious to me is that chiplets are a technology to improve yields and flexibility, not performance, that's why I'm asking.While I do not know the exact mechanisms, the fact that it does is obvious, is it not? If it weren't improving performance, why do it at all?
~
I wrote this for a different thread, but it applies here:
The hope, as I see it, without any expertise other than having good reading-comprehension skills and a bit of time on my hands, is that Apple could introduce an Ultra-only secondary SoIC chip (to be paired with a standard Max) that replaces the CPU die with a second GPU die. To my mind, that fits Apple's criteria -- it doesn't require much additional R&D other than with regard to UltraFusion's local ("L") silicon interconnect, which is presumably already part of the Ultra budget.
This theory assumes that the CPU and GPU dies are similar in size, which I don't think is something we know at this point.
The use of a horizontal layout inside the IC is probably confirmed now, since the original rumor from Ming-chi Kuo (screen shot of X post from December 2024 attached, now confirmed to have been accurate with regard to SoIC, at least) specified something called "SoIC-mH" (where "H" = horizontal) -- unlike Gurman, Kuo is more rigorous and does not mix-and-match things that he hears. So I think it's a safe bet.
The only thing obvious to me is that chiplets are a technology to improve yields and flexibility, not performance, that's why I'm asking.
Off the top of my head and not being specific:
* Separation of ’tiles’/chiplets -> lower temperatures -> higher performance potential
* Higher yield rate due to smaller chips may allow for more complex, better architechture/design, which lower yields, but since yields are up due to chiplets, it evens out
* Same argument but regards to new fab tech -> Apple might be more willing to try bleeding edge nodes, since chips are smaller -> higher performance
* Separation of chiplets may allow for faster architechture designs - for example cache and RAM could become more easily placeable and accessable -> higher performance
Thanks!chiplets intercommunication doesn't lower power consumption. It is higher. Can offset that by using smaller connections at absolute minimal distances. but the style that AMD has used on Ryzen/Epyc for first couple of generations isn't lowering. [ Laptop in those generations there are no chiplets. ]
Errr. more so chiplets allow optoin to use two different fab processes. One more dense that is kept smaller ( so can go more complex) and other better sutited to something that doesn't require maximum desne (like I/O ). Most of the time something that is 2-3 years mature has higher yields than something that is 1 (or less) years old. It is more so when you can use it more so than the more complex is worse (forever).
doing a 'complex' which is really a mismatch with the fab process design kit is going to get you lower yields. However, that is more a learning curve issue. In general "better architecture/design" should at least be in part , fewer mistakes. The most Rube Goldberg complex microarchitecture probably isn't 'better'.
bleeding edge usually comes with relatively lower yields . What limiting here is the impact of the defects to a smaller area. ( so that may be able to salvage the die to something useful. ). The defects are not disappearing. What you are doing with chiplets is limited the scope of impact ( less collateral damage to stuff that didn't have the defect).
Stacking cache on a CPU cihplet isn't 'easier'. But it can cut down the route to where the data has to go with blowing out the reticle limit.
There would still be a CPU die, in the first Max. You would only drop the CPU in the second Max, replacing it with another GPU.Two 'GPU' dies ends up with zero CPUs. How is that going to be a useful "Ultra'. The CPU dies likely contains the I/O (thunderbolt / usb ) , the SSD , Security enclave . If drop those also how useful a Mac will you have?
I’m not sure I understand you. TSMC says plainly that SoIC (which Apple is calling Fusion Architecture SoC) can still be interconnected via advanced packaging like InFO-LSI (UltraFusion). They (Fusion and UltraFusion) aren’t the same thing. As I understand it, InFO-LSI doesn’t require symmetry — the interconnected units don’t have to be identical — so it seems to me it could be used to do what I’m describing.Their New fusion is "two dies" joined. That makes it very likely that that each of these only one one "fusion edge". ( Just like the 'twin' Max die Ultras packages. ) […]
There would still be a CPU die, in the first Max. You would only drop the CPU in the second Max, replacing it with another GPU.
The classic Ultra (2x Max) would still be an option, but there would be an option for an Ultra configuration with 1x CPU and 3x GPU. That’s all I’m suggesting.
I’m not sure I understand you. TSMC says plainly that SoIC (which Apple is calling Fusion Architecture SoC) can still be interconnected via advanced packaging like InFO-LSI (UltraFusion). They (Fusion and UltraFusion) aren’t the same thing. As I understand it, InFO-LSI doesn’t require symmetry — the interconnected units don’t have to be identical — so it seems to me it could be used to do what I’m describing.
I can't believe we don't even know the sizes of the SLC.I can’t believe we still don’t have any M5 Pro or M5 Max die image…
I’m not aware of a useful M4 Pro or M4 Max die image, let alone M5…I can’t believe we still don’t have any M5 Pro or M5 Max die image…
eclecticlight.co
The M5 Pro and Max have 1 cluster with 6 S-cores and 16MB shared L2, and 2 clusters with 6 P-cores and 8MB shared L2 each. Total is 3 clusters, 18 cores and 32MB of L2. We still don't seem to know anything about the SLC.Have we gotten any cache size information yet?