For those who run LLMs on their Apple Silicon Macs: LMStudio beta adds support for speculative decoding and it works with MLX models too!

VitoBotta · Feb 15, 2025

A few days ago, I switched to MLX models supported by LMStudio and noticed a nice speed improvement compared to GGUF models. Recently, with the addition of speculative decoding in LMStudio beta, things have gotten even better—I’m seeing a 20 to 50% speed gain!

If you’re not familiar, speculative decoding is a technique that boosts inference for larger models by combining them with a faster, smaller model called the "draft." Interestingly, it seems that using speculative decoding doesn’t compromise quality compared to just using the main model alone. Pretty cool, right?

senttoschool · Feb 15, 2025

Thanks for the notice. Nice to see continued optimization for AS. Hopefully Apple puts dedicated resources into optimizing open source LLM inference on Macs like they do for Blender.

VitoBotta · Feb 16, 2025

One thing I'm really looking forward to seeing is whether we'll be able to run the LLMs using the neural engine instead of the GPU sometime. I wonder if this would lead to better performance.

haralds · Feb 16, 2025

Ollama uses the neural engine. LM Studio uses the metal engine.
LMStudio is reportedly faster than Ollama.

VitoBotta · Feb 18, 2025

Why has this thread been moved to "Apple Intelligence and Siri"? It literally has nothing to do with those.

senttoschool · Feb 19, 2025

Yea, hey mods, this has nothing to do with Apple Intelligence. Apple Intelligence is Apple's ML tools. This is local LLM such as running deep seek using the Mac GPU. Move it back to Apple Silicon Macs.

smithrh · Feb 20, 2025

I think there should actually be a sub forum for running AI/LLMs *on* the Mac.

I've asked for sub forums in the distant past and they've been created, probably worth a shot.

Search

Search

For those who run LLMs on their Apple Silicon Macs: LMStudio beta adds support for speculative decoding and it works with MLX models too!

VitoBotta

macrumors 65816

senttoschool

macrumors 68030

VitoBotta

macrumors 65816

haralds

macrumors 68040

VitoBotta

macrumors 65816

senttoschool

macrumors 68030

smithrh

macrumors 68030

Our Staff