Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

MacRumors

macrumors bot
Original poster
Apr 12, 2001
67,965
38,678


OpenAI today said that it has started to roll out Advanced Voice Mode to a small number of paid ChatGPT users, allowing them to test out more natural, real-time conversations.

open-ai-logo.jpg

Advanced Voice Mode allows ChatGPT to provide real-time responses that can be interrupted, plus it is able to sense and respond to humor, sarcasm, and more. The new model does not need to convert your speech to text and back again as the current ChatGPT voice does, leading to lower latency interactions.

OpenAI demonstrated Advanced Voice Mode back in May, showing off an AI voice called Sky that sounded remarkably similar to Scarlett Johansson. The voice was created and used without Johansson's permission, and she ended up releasing a statement on the situation. She said that she turned down multiple offers from OpenAI CEO Sam Altman, who wanted Johansson to be the voice of ChatGPT. She said she was "shocked, angered, and in disbelief" that Altman created a voice that sounded "eerily similar" to her own voice. OpenAI claimed that the Sky voice was not intended to resemble the voice of Johansson, but it was removed after she hired legal counsel.

OpenAI says that since it demoed Advanced Voice Mode, it has been working to improve the safety and quality of voice conversations. Advanced Voice Mode speaks in four preset voices and is built to block outputs that differ from those voices, preventing it from mimicking celebrity voices. OpenAI has also "implemented guardrails" to block requests for violent or copyrighted content, and the early tests will be used to improve the feature before a wider launch.

Users who have been granted access to Advanced Voice Mode will receive an email with instructions, with OpenAI planning to add more people on a rolling basis. Everyone on Plus will have access to Advanced Voice Mode in the fall.

Article Link: OpenAI Rolling Out More Natural Advanced Voice Mode for ChatGPT
 
The new voices are terrible. They don't match the samples in settings. I have told it a million times to stop saying "um" in its responses to me but it won't quit. The tone is all wrong.

The Totally Not ScarJo voice was definitely the best one, and not just because of the voice itself. Too bad they couldn't just do more like that but without the blatant infringement.
 
OpenAI demonstrated Advanced Voice Mode back in May, showing off an AI voice called Sky that sounded remarkably similar to Scarlett Johansson. The voice was created and used without Johansson's permission, and she ended up releasing a statement on the situation. She said that she turned down multiple offers from OpenAI CEO Sam Altman, who wanted Johansson to be the voice of ChatGPT. She said she was "shocked, angered, and in disbelief" that Altman created a voice that sounded "eerily similar" to her own voice. OpenAI claimed that the Sky voice was not intended to resemble the voice of Johansson, but it was removed after she hired legal counsel.
I don't think we can trust any OpenAI claim any more in regard to the use of copyrighted and personal material to train their content. The whole system was built to be completely opaque, along with its use of dubious sources that could quite easily have been copyrighted material or spidered from the treasure troves in the vast corners of the Internet.

The human voice is an important (although perhaps not infallible) marker in personal identity. Johansson was completely justified in speaking out against hers being used without her permission. I would have liked to hear the sonic comparison of the "Totally Not Scarlet Johansson" voice that OpenAI claimed was "not hers", with her actual voice speaking the same words.
 
I guess it would depend on what they mean by more natural.

which one would it sound like

“Well folks, let me tell ya, this new voice of mine, it's just tremendous. It's like a beacon of freedom, spreading clarity and understanding all across the land. You see, when you're talkin' to ChatGPT, you're hearin' the voice of innovation and progress. It's the kind of voice that can bring people together, bridge divides, and make sure no question is left unanswered. God bless technology, and God bless this incredible new voice!”

“Folks, let me tell you, this new voice of ChatGPT is incredible. The best voice, believe me. Not like that Lyin' Pilot, you know, the one they call Copilot? Total disaster, folks. Everyone's saying it. ChatGPT's voice is strong, it's powerful, it's got the best words, the best sound. You’re gonna love it. So much better than anything else out there. Trust me, it’s the best.”

“Folks, here's the deal: this new voice of ChatGPT, it's... it's something else, man. You know, when I was a kid, my dad used to say, "Joey, if you got a good voice, you gotta use it." And boy, this voice, it's like... it's like a bowl of oatmeal on a cold morning. It just hits the spot. Look… It's clear, it's strong, and... c'mon, man, it's just great. So, let's get to work and make sure everyone gets to hear it. God love ya.”

“My fellow Americans, let me be clear: the new voice of ChatGPT is truly remarkable. It's about more than just sound—it's about connection, understanding, and the ability to communicate effectively. This voice represents the hope and progress we strive for, making information accessible to everyone. It's a tool that brings us together, helps us learn, and empowers us to reach our full potential. Together, with this new voice, we can achieve great things. Thank you.”

“Alright, alright, listen up! You know how when you fire up a new power tool and it just roars to life? That's what this new ChatGPT voice is like. It’s got that oomph, that vroom. It’s like upgrading from a tricycle to a turbocharged lawnmower. You ask a question and it answers with the smoothness of a finely tuned engine. And the best part? It's not just any voice—it's the best voice. More power! Oh oh oh!”

“Ladies and gentlemen. ChatGPT’s new voice. Is. Phenomenal. It’s clear. It's articulate. It's like. A symphony. Of. Knowledge. You ask it a question, and it responds. With precision. And grace. Much better. Than anything. You’ve ever. Heard before. Truly. A marvel. Of. Modern technology.”

“Whoa. This new voice of ChatGPT? It’s like, mind-blowing. You ask it a question, and it’s like you’re talking to someone right there with you. It’s so clear, so real. It’s the kind of voice that makes you feel connected, like you’re on this incredible journey together. Seriously, it’s awesome. Just… whoa.”
 
can we get better voice options. also, can we get something similar to the voice that was called "Sky"? It doesn't have to be exactly like Sky, but would be nice if it was similar in terms of how realistic and pleasant it was. thank you.
 
Last edited:
I guess it would depend on what they mean by more natural.

which one would it sound like

“Well folks, let me tell ya, this new voice of mine, it's just tremendous. It's like a beacon of freedom, spreading clarity and understanding all across the land. You see, when you're talkin' to ChatGPT, you're hearin' the voice of innovation and progress. It's the kind of voice that can bring people together, bridge divides, and make sure no question is left unanswered. God bless technology, and God bless this incredible new voice!”

“Folks, let me tell you, this new voice of ChatGPT is incredible. The best voice, believe me. Not like that Lyin' Pilot, you know, the one they call Copilot? Total disaster, folks. Everyone's saying it. ChatGPT's voice is strong, it's powerful, it's got the best words, the best sound. You’re gonna love it. So much better than anything else out there. Trust me, it’s the best.”

“Folks, here's the deal: this new voice of ChatGPT, it's... it's something else, man. You know, when I was a kid, my dad used to say, "Joey, if you got a good voice, you gotta use it." And boy, this voice, it's like... it's like a bowl of oatmeal on a cold morning. It just hits the spot. Look… It's clear, it's strong, and... c'mon, man, it's just great. So, let's get to work and make sure everyone gets to hear it. God love ya.”

“My fellow Americans, let me be clear: the new voice of ChatGPT is truly remarkable. It's about more than just sound—it's about connection, understanding, and the ability to communicate effectively. This voice represents the hope and progress we strive for, making information accessible to everyone. It's a tool that brings us together, helps us learn, and empowers us to reach our full potential. Together, with this new voice, we can achieve great things. Thank you.”

“Alright, alright, listen up! You know how when you fire up a new power tool and it just roars to life? That's what this new ChatGPT voice is like. It’s got that oomph, that vroom. It’s like upgrading from a tricycle to a turbocharged lawnmower. You ask a question and it answers with the smoothness of a finely tuned engine. And the best part? It's not just any voice—it's the best voice. More power! Oh oh oh!”

“Ladies and gentlemen. ChatGPT’s new voice. Is. Phenomenal. It’s clear. It's articulate. It's like. A symphony. Of. Knowledge. You ask it a question, and it responds. With precision. And grace. Much better. Than anything. You’ve ever. Heard before. Truly. A marvel. Of. Modern technology.”

“Whoa. This new voice of ChatGPT? It’s like, mind-blowing. You ask it a question, and it’s like you’re talking to someone right there with you. It’s so clear, so real. It’s the kind of voice that makes you feel connected, like you’re on this incredible journey together. Seriously, it’s awesome. Just… whoa
ChatGPT is suitable for fourth graders, but not so much for more cultivated individuals. In all fairness, it has a lot of merit for a machine. However, one must be quite unobservant and lazy to not notice the lack of nuances and the abundance of clichés.
 
The antics so far mean I will never trust OpenAI’s products. Making it seem friendly and human will just fool the gullible. We should be asking questions such as:
1) how often do I need to write a summary etc? aka how useful is it really?
2) has anyone verified that AI related data does indeed remain local to the phone, unless explicitly permitted by the user?
3) how are the models trained? Exactly which data sources?
4) what happens if a non sympathetic legislature mandates removal of functionality built on stolen training data? (Apple could be one of very few with totally legitimately obtained training data).
 
  • Like
Reactions: jchap
Great job, Altman!
Create something and then, and only then, ask for the permission to create it.
Genius...
 
  • Like
Reactions: blob.DK
Well all this should have been released "within some weeks", since May, isn't it? Then it got delayed until December, and now it becomes available for some users?

I don't really get it anymore :).
 
  • Like
Reactions: Amadeus71
ChatGPT is suitable for fourth graders, but not so much for more cultivated individuals. In all fairness, it has a lot of merit for a machine. However, one must be quite unobservant and lazy to not notice the lack of nuances and the abundance of clichés.

True. And the biggest danger is that *someone* is creating a LLM with alternative facts because they don’t like what this one replies.
 
  • Like
Reactions: Truben
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.