I was talking about their h200 and soon b200 series not gaming GPU's lol.I have a Nvidia 4090 workstation. My M1 Max runs stuff 4090 can only dream. Nvidia need to up the 24 GB VRAM or lower prices of GPU with more than 40 GB. If Apple can provide 256 GB unified memory, I can lower my cloud costs(already low with M1 Max).
Apple needs to allow multiple studios to have cluster of GPU and memory.
h200 VRAM: Up To 141 GB HBM3e @ 6.5 Gbps
And that's called unified ram, not VRAM specifically where you have to waste memory on everything which maxes out at 400 GB/s BW. I'm pretty sure that's even slower than RTX 3090's VRAM (as expected ofc).
Apple is currently busy selling their last generation $1800 laptop with 8GB ram, which has less RAM than majority of android phones. I don't think those glorious days will come anytime soon.
Sure those GPU's much more expensive but M2 Ultra is like 31.6 TOPS meanwhile h200 is 3900ish TOPS and soon b200 with 20,000 TOPS.
Comparing M2 Ultra to h200 in AI is more like comparing GT 710 to RTX 4090 in gaming.