Ah I see your confusion. I wasn't suggesting that the logic dies would be vertically stacked - SoIC allows for both vertical and 2.5D interposer connects (as Apple already uses for its fusion connector).
Confused? You snipped out the leman quote that I was responding to which started off:
previous to that my comment was in response to tenthousandthings commentLooking at the wording from Apple patents describing stacked die solutions, a focus appears to be on increasing the effective transistor budget without increasing the package area.
... We can assume Apple’s priorities haven’t changed. ...
Apple reportedly clearly lays out their objective and I'm confused because there is a disconnect between their object and your horizontal expansion proposals? If the length and width of the package is pragmatically fixed, that would be a constraint that Apple didn't want to abandon. The Max package really can't get too much bigger in the MBP 14" logical board.
Macbook Pro 14" Late 2023 (M3 Pro and M3 Max) Logic Board Replacement
Use this guide to replace the logic board in your MacBook Pro 14" late 2023 (M3 Pro and M3 Max). Note that Touch ID will not function after...

The Mac Studio has similar issue with a "double Max" solution for the Ultra. The Ultra's package really can't much bigger.
The die size and the package size are different. The pins to the off-package I/O just tend to take up more space. (so there has been some slop in dies growing while package stays the same time. )
My post is more theoretical about what Apple *could* use SoIC-mh for in the Pro and Max line as Kuo and Ma have both released notes suggesting that will be implemented - Kuo specifically in the M5 Pro/Max. If my memory is right he specifically mentioned the 2.5D connector, not 3D. If so, vertical cache is less likely for the M5 Pro/Max. Ma's note on the subject was from awhile ago and if memory serves merely mentioned that Apple would explore using tile-like structures for its Pro/Max line of chips for the M5/M6. This could be wrong of course.
The 2.5D connector is going to buy increased 2-D transistor density how? Let's say more the "I/O" intense stuff off to a N4/N5 die. The 2D area that the I/O is taking up isn't shrinking there ( might be slightly bigger since the smaller fraction of logic in the I/O subsystem might have gotten bigger when backsliding on fab process). If take the "just as big" I/O die and "just as big" logic die and 'glue' them back together with a 2.5D process did the area go down?
If shooting for a radical increase in performance, then probably not. The logic tile/chiplet probably grew a bit even with improved process node ( just as the path Apple has been on with Max growing larger and larger. )
Perhaps if Apple did a N3X logic-only die and a N3P 'the rest" die then perhaps the 2D footprint area could do down. The I/O die isn't so much getting 'cheaper'. It just is more flexible ( stays on Fin-Flex while logic only tosses that flexibility to focus primarily on getting smaller (e.g., reverse the bloat from previous generations). ). To get an aggregate package transistor density increase, it gets hard to include dies that significantly backslide on transistor density and still get a net increase if no stack. The 'other' die(s) with the increase has to make up the backslide deficit before get any net increase.
Apple working with Broadcomm to do a server SoC in 2027 (M6 like time frame) . If it was Max and server were sharing and bigger gaps on I/O that makes more sense to me. But the Pro, SoIC applied to that keeping the costs the same seems like a stretch. (very similr to the Rube Goldberg Mn Extreme rumors that looked more and more expense the more folks tried to push that rock up a hill in rumors. )