NEW YORK – Earlier this year, OpenAI amazed users by unveiling an enhanced voice mode for the latest version of ChatGPT. Unlike the robotic voices of digital assistants like Alexa or Siri, this new ChatGPT voice mode sounds incredibly lifelike. It can respond in real-time, handle interruptions, make giggling sounds when users joke, and assess a speaker’s emotional state based on their tone. During its initial demonstration, the voice even sounded strikingly similar to actress Scarlett Johansson’s.
Starting August 6, the advanced voice mode, compatible with ChatGPT-4, will begin its rollout to paid users. Initially, it will be available to a small group of subscribers in the app’s “Plus” mode, with plans to extend it to all Plus users by fall.
While ChatGPT already has a basic voice mode, this advanced version represents a significant leap forward. It could transform ChatGPT from a notable AI chatbot into a virtual personal assistant, enabling users to engage in natural, spoken conversations as they would with a friend. This ease of interaction could boost user engagement and present a challenge to existing virtual assistants from companies like Apple and Amazon.
However, the introduction of this advanced voice mode raises important questions: Will it consistently understand users, including those with speech differences? And will the human-like voice lead users to overly trust the AI, even when it makes mistakes?
OpenAI initially planned to launch the advanced voice mode in June but postponed it to ensure the tool’s safety and effectiveness. Over recent months, the company tested the AI’s voice capabilities with more than 100 testers from diverse linguistic and geographical backgrounds, covering 45 different languages across 29 regions.
To ensure safety, OpenAI has limited the voice mode to four pre-set options created in collaboration with voice actors, preventing impersonation. Additionally, it will block requests to generate music or other copyrighted audio and will include the same protections as ChatGPT’s text mode to avoid producing illegal or harmful content.
One major change from the May demo is the removal of the voice that many believed resembled Scarlett Johansson’s. Although OpenAI clarified that the voice was created with a different actor, it decided to pause its use out of respect after Johansson raised concerns.
The launch of ChatGPT’s advanced voice mode follows OpenAI’s recent announcement of testing a search engine powered by its AI technology. This new search engine could eventually challenge Google’s dominance in the online search market as OpenAI continues to expand its range of consumer-facing AI tools.
Copyright © 2024 The Daily Star. Provided by SyndiGate Media Inc. (Syndigate.info). All rights reserved.