Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

MacRumors

macrumors bot
Original poster


Apple reportedly plans to use next month's Worldwide Developers Conference (WWDC) to highlight its on-device AI capabilities as a competitive advantage, leaning on 15 years of custom silicon expertise to make the case for running AI models locally rather than in the cloud.

Apple-Silicon-AI-Optimized-Feature-Siri.jpg

People familiar with Apple's plans speaking to The Information say the company is expected to showcase how the chips designed for iPhones, Apple Watches, and Macs give it an edge in processing AI queries directly on devices. While cloud-based processing will remain necessary for complex queries, Apple will position local inference as a privacy-preserving, cost-saving alternative to the massive data center buildouts its rivals have pursued.

As part of its agreement with Google, Apple is apparently set to use a large version of Google's Gemini model to train a smaller, distilled version capable of running locally on Apple hardware. Apple is also said to be scouting acquisitions to help advance its model-shrinking work, with one company it has reportedly considered being Liquid AI, a Massachusetts startup focused on running AI locally on devices.

Some queries will still require cloud processing. Apple is believed to have approved the use of Nvidia's confidential compute technology within Google Cloud to handle processing of the larger Gemini-based model. The security feature encrypts data and AI models during processing, adding a modest performance cost but offering stronger privacy protections.

The arrangement represents a noticeable departure from Apple's original Apple Intelligence announcement, in which the company said all cloud-bound queries would be handled exclusively by its own Private Cloud Compute infrastructure running on Apple silicon. Apple is likely to retain the Private Cloud Compute branding despite the change, people familiar with the partnership told The Information.

There are also said to be material limits to how far Apple can push on-device processing. Google's full Gemini model runs into the trillions of parameters, and The Information claims that Apple has struggled to run it on its own Private Cloud Compute infrastructure, which uses the same Apple silicon chips found in Mac computers.

Apple Intelligence was first announced at WWDC 2024, but the rollout has been hampered by a tepid response to initial features and a protracted delay to the more personal version of Siri. Apple is now expected to use WWDC 2026, which runs from June 8 to reframe the narrative, reintroduce the delayed features, and debut new ones.

Article Link: Report: Apple Plans to Make On-Device AI a Key WWDC Focus
 
> As part of its agreement with Google, Apple is apparently set to use a large version of Google's Gemini model to train a smaller, distilled version capable of running locally on Apple hardware.

We already got this hints and you can try them now with Gemma4

I just hope local models will have proper APIs in OSes so you won't need to download same dataset multiple times and also that it won't be Apple-ish in the sense it only closed to their models and only being updated ever mac/i/PadOS /etc iteration
 
  • Love
Reactions: _Mitchan1999
what tasks can run on-device? I'm using AI pretty much only as a consumer for complex google queries. And I assume I'm using AI for some modest image editing and image search in my photo library. I'm currently not using any agents. The iOS uses it to search my emails/texts for contacts and appointments in the background. I can only see the iOS searches in my emails/texts running on device. The rest either needs up to date internet access or are to resource intensive.
 
AI tech is rapidly evolving. It's great that Apple is able to quickly pivot and offer AI tech that best meets customer needs while offering the best user privacy.

That said... I expect most people here will announce (multiple times over the next few months) they'll quickly turn it off as soon as it comes out, as has happened in the past.
 
  • Like
Reactions: Seanm87
> As part of its agreement with Google, Apple is apparently set to use a large version of Google's Gemini model to train a smaller, distilled version capable of running locally on Apple hardware.

We already got this hints and you can try them now with Gemma4

I just hope local models will have proper APIs in OSes so you won't need to download same dataset multiple times and also that it won't be Apple-ish in the sense it only closed to their models and only being updated ever mac/i/PadOS /etc iteration
I am pretty happy with Gemini Nano v2 on my ZF7 since I'm not much for asking AI questions about stuff when I can just google it. I just need something that I can make calls, appointments, respond/summarize email/texts etc. with hands free. Let's see how this stacks up to it, for all I know it is probably the same thing? I've also tried Gemini on the iPad and its system wide integration is terrible so hopefully they would improve on that as well.
 
Aside from comforting shareholders that Apple isn't completely missing the AI boat, I'm not sure who this is suppose to reassure.
 
I think the name “Siri” brings so much bad energy that they should do like Microsoft with Internet Explorer and rebrand it. Even if they make Siri work, everybody already has this idea that it sucks no matter what.
 
I’m going out on a limb and saying full featured on-device Apple Intelligence or other LLMs will not be featured on current gen Apple Watches. Now, if future watches have their System on Chip printed on 2nm fabs, hell, iPhone Air 2 could just be a screen that attaches to your watch via a short range 1-2meter Airplay connection 👍
 
> I just hope local models will have proper APIs in OSes so you won't need to download same dataset multiple times and also that it won't be Apple-ish in the sense it only closed to their models and only being updated ever mac/i/PadOS /etc iteration

Don't they already have local models available to developers?

 
  • Like
Reactions: neuropsychguy
I think the name “Siri” brings so much bad energy that they should do like Microsoft with Internet Explorer and rebrand it. Even if they make Siri work, everybody already has this idea that it sucks no matter what.

I agree. Not without precedent either when they dumped MobileMe in the transition to iCloud.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.