I’ve just received an update saying my Studio has been dispatched, and I have a UPS tracking link. The link shows it being in Shenzhen, China on the 10th, and it’s on the way (Chek Lap Kok, Hong Kong) as of a moment ago. The estimated delivery is shown as Thursday, 13th (by the end of the day).
Models larger than 70b or so are likely not going to be usefully fast. They will run, but you'll get a couple of tokens per second. So if you just want to start a run and go have lunch, then maybe you'll use the big models, but otherwise probably not.
And that is why I'll likely skip the M3 Ultra because the extra price for 256GB or 512GB still won't deliver useful speed for LLMs.
That was one of my deciding factors against going for the M3U with 256GB (okay, and the massive price! 🤣 ). I’ve been watching Alex Ziskind on YouTube and he’s done comparisons - including with 128GB M4 Max on a MBP. While you can load up 70B models, they’re not always going to be all that quick. Tolerable, acceptable even, but if you want speed, probably not so much.
Seems to me there’s a law of diminishing returns with Macs. More RAM will enable you to load larger models, but speed will suffer. I think it’ll probably do fine with some 34B models I’ve downloaded.
I went with the 128GB model because:
(i) I will
occasionally run 70B models (just because I can);
(ii) I’ll run smaller models, alongside Stable Diffusion for image generation, and sometimes also have Parallels running Windows 11 for small projects. This, combined, is all too much for my humble 24GB MBP;
(iii) I want to run LLMs with larger context lengths (I’d like it to summarise full scenes/chapters of the story I’m writing without me having to break them into smaller “acts”);
(iv) I’ve been playing about with ComfyUI for video generation with WAN and, on my 24GB MBP, it sucks up swap space like it’s going out of fashion - like it’s trying to use 60GB (RAM + 40GB SWAP) or something. It does work on the MBP, but it kind of takes over a full day for a 5s clip. I’d like to continue experimenting with things like this without tying up my entire machine for a day each time.
I think 128GB RAM will help towards all of the above.