How many Mac Studios would you need to connect that?
Up to a max of 4. Effectively 2TB of RAM, a 320 core GPU, 128 core CPU, 128 core NPU.
It's the shared RAM from using RDMA to create up to 2TB of vRAM that makes it compelling for enterprise local LLMs.
US$40k for that setup.
So not a consumer level feature or rig.
If you watch the youtube videos I linked to in my OP they'll point out how compelling that setup is to alternatives from NVidia etc in both price and power consumption.
The reason Apple doesn't update the "Ultra" series of M processors as regularly as the standard, Pro and Max will be due to lack of volume.
If a feature like locally hosted LLMs helps enterprise sales of Mac Studio Ultras, increasing that volume, it might enable Apple to justify more regular iterations of the Ultra series.
As I said in my OP it's all about memory and memory bandwidth at this point in time and that might be where the focus is on the M series roadmap.
The next iteration of M Pro/Max/Ultra might allow for more than 4 units in a RDMA cluster, or more likely, much higher bandwidth over RDMA between each unit which is the real limitation.