ChatGPT’s new features are making headlines again, as the company introduces its new voice assistant and image input features. OpenAI, a San Francisco-based artificial intelligence research laboratory, published its notes on the ChatGPT updates, which will now allow users to communicate with the AI platform on a much more personal level. While users already have considerable experience with AI assistants like Siri, Alexa, and Cortana, the ChatGPT speech updates will combine the powerful nature of its NLP abilities with the voice feature to make data more accessible than ever before. Additionally, the OpenAI ChatGPT upgrade will also allow users to send pictures to ChatGPT to better explain their questions to the AI. Here’s what we know so far.
ChatGPT: New Features and Speech Updates
The OpenAI ChatGPT upgrades have been shifting the AI game with each new iteration. Earlier this year, the company announced that it was set to support the integration of the DALL-E 3 image-generation features within ChatGPT, to make it easier for users to formulate their prompts and generate unique images. This current introduction of ChatGPT updates adds to the excitement, especially for Plus and Enterprise users who should have access to the feature over the next two weeks.
The ChatGPT speech updates state that users with the iOS and Android apps can now enable the voice conversation feature through the settings page, in order to talk to the AI. The website reports that you can request it to tell you a bedtime story or contribute to a dinner debate, as a way of introducing its application avenues. ChatGPT’s new features should allow listeners to select from five different voices—Juniper, Sky, Cove, Ember, and Breeze—giving them the option to customize it to their preferences.
OpenAI also reports that the ChatGPT speech updates to be powered by a new text-to-speech model that should allow the AI to generate human-like audio from some text and few seconds of a sample. ChatGPT’s new features also make use of the open-source Whisper system for speech recognition services that convert audio into text.
Image Input: Another One of ChatGPT’s New Features
In addition to the ChatGPT speech updates, the AI tool will also be able to take image inputs and generate responses to them. These ChatGPT updates will allow customers to upload multiple images to the platform and ask it to interpret, analyze, or resolve any data extracted from the pictures.
According to OpenAI, this tool can be as it will allow you to “explore the contents of your fridge to plan a meal” or “analyze a complex graph for work-related data.” The website highlights how someone could use the OpenAI ChatGPT upgrade to find out how to lower a bike seat by showing the AI a picture of the bike in question. Not only will the AI be able to provide them with instructions, but it will also be able to look at a picture of the tools they have and tell them what they can use. This presents a fascinating upgrade in the world of AI.
OpenAI ChatGPT Upgrade Cations against Misuse
ChatGPT’s new features open up the door for a variety of uses however, OpenAI acknowledges the significant potential for its misuse. The voice crafting technology the company uses requires very few inputs to imitate real speech and can be gravely misappropriated for malicious activities. Earlier this year, a Vice article reported on how they were able to trick a bank’s voice authentication security system by using AI to clone a voice. Optimizing generative AI comes with a host of risks. In an attempt to control access to its entire use case until safeguards can be put in place, the company has limited the voice chat to imitate voice actors who have collaborated with the company.
In terms of the vision-based model that backs the ChatGPT updates, OpenAI has been cautious in testing its use with red teamers and alpha testers before release.
Other Collaborations with ChatGPT’s New Features
The OpenAI ChatGPT upgrade has been reported to support Spotify’s growing investments in its podcast offerings. The tool should allow podcasters to translate their content into other languages more easily while using their own voice, thus increasing their reach. OpenAI also reported on its collaboration with Danish startup Be My Eyes and the development of their virtual volunteer with the support of GPT-4. Their approach to vision-based services is shaped greatly by the inputs they have received through this association.
The new ChatGPT features are an exciting advancement in technology and can significantly simplify lives if the AI works as efficiently as it has been reported. Plus and Enterprise customers should soon be able to share their experience of using the features that arrive with the ChatGPT updates to better inform OpenAI of the direction it should take next.