Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

DavidChoux

Suspended
Original poster
Jun 7, 2022
239
254
Just wanted to clarify how this new feature works.

I'm currently learning Chinese and I find often watch Youtube videos with hard coded subs (embedded into the video picture). As I understand with this new feature if I'm watching video on Youtube and want to know what a word is, I can just pause the video, enable live text but tapping something, and it'll recognise the text on the screen, allow me to copy it, and then I can simply google search it or put it into a dictionary that I have or whatever.

Is that correct?

Thanks.
 
if.the.developer.of.the.app.support.it.

I though it was an OS wide thing? Like it just does it on device on any app, like how screenshot works on any app. Or i guess that'a not how these things work, and developers have to 'allow' iOS to perform screenshots on their apps.
 
as far as I can see a developer has to implement wether live captions can be activated. Even Apple does this explicitly for each app, e.g. you have ”activate” this feature in FaceTime or RTT.

E803E738-8B27-4C4E-968C-5FCF6C9D4B6D.jpeg
 
It seems to be system wide? I tried to take a screenshot but it doesn't appear in them, so these poor quality photos will have to do: (Photos removed to avoid confusion)

Edit: Just realized you said Live Text not captions. That doesn't seem to be available.

Edit 2: I tested YouTube in Safari and it does support Live Text while paused. Another thing you can do system wide is simply take a screenshot and Live Text can be used from there.
 
Last edited:
Yeah sorry guys I mean Live Text, not captions.

But just to clarify what I mean.

Watch a YouTube video in Chinese with hard encoded subs (or overlayed subs for that matter) in a Chinese. When I come across a word in the subs that I want to know, I pause the video, it performs some kind optical recognition, recognises the Chinese characters and then allows me to copy that text to a dictionary or whatever.

So I'm pretty sure that's Live Text, right?
 
Yeah sorry guys I mean Live Text, not captions.

But just to clarify what I mean.

Watch a YouTube video in Chinese with hard encoded subs (or overlayed subs for that matter) in a Chinese. When I come across a word in the subs that I want to know, I pause the video, it performs some kind optical recognition, recognises the Chinese characters and then allows me to copy that text to a dictionary or whatever.

So I'm pretty sure that's Live Text, right?
Yes. And Live Text (not captions) works in Safari on the YouTube website for me but not in the YouTube app right now. Maybe they'll add support for it later. If not you can always take a screenshot and select text from there.
 
Yes. And Live Text (not captions) works in Safari on the YouTube website for me but not in the YouTube app right now. Maybe they'll add support for it later. If not you can always take a screenshot and select text from there.

ah right, yeah taking a screenshot is what I usually do, but it would be cool if it could happen without needing to do that. Hopefully the app will support that in the near future.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.