Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

hehe299792458

macrumors 6502a
Original poster
Dec 13, 2008
783
3
Curious if the underlying transcription technology / engine has been changes for iOS 18? For example, with iOS 17 & before, the voice to text accuracy is less than OpenAI’s whisper model. How about for iOS 18? Better? Compared to Whisper?
 
Also very interested in this question if anyone has the iOS 18 or macOS Sequoia betas installed right now, please let us know what you're able to learn! I'm very curious to know what language model is powering the new transcription feature in the Notes app and the Voice Memos app (which I presume utilize the same ML frameworks across all Apple platforms). I'm finally seeing real-world examples of the transcription (screenshots from public beta videos attached) , but want to know what model is used and how it benchmarks against OpenAI's Whisper v3 (which consistently performs superbly).
 

Attachments

  • Screenshot 2024-07-23 at 4.01.50 PM.png
    Screenshot 2024-07-23 at 4.01.50 PM.png
    632 KB · Views: 38
  • Screenshot 2024-07-23 at 4.03.21 PM.png
    Screenshot 2024-07-23 at 4.03.21 PM.png
    1,014.1 KB · Views: 34
  • Screenshot 2024-07-23 at 4.03.01 PM.png
    Screenshot 2024-07-23 at 4.03.01 PM.png
    1.1 MB · Views: 25
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.