In the dynamic world of artificial intelligence (AI), there's no halt in progress. OpenAI has now introduced exciting new features to their chatbot, ChatGPT.
ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms). https://t.co/uNZjgbR5Bm pic.twitter.com/paG0hMshXb
— OpenAI (@OpenAI) September 25, 2023
The company has upgraded ChatGPT's capabilities to comprehend both voice and images:
Speak with ChatGPT:
— Rowan Cheung (@rowancheung) September 25, 2023
You can now use voice to engage in a back-and-forth conversation with your assistant.
The hyper-realistic text-to-speech model allows you to choose from five different voices.
On mobile, opt-in to voice in Settings → New Features on the mobile app. pic.twitter.com/8VwiLxghfP
OpenAI is also collaborating with other companies to employ this new technology.
Spotify, in particular, is partnering with the AI startup to translate podcasts into more languages, utilizing the voices of the podcasters themselves.
Spotify collaboration:
— Rowan Cheung (@rowancheung) September 25, 2023
The new text-to-speech model is already being used in Spotify's Voice Translation feature pilot.
AI-translated podcasts are coming to Spotify. pic.twitter.com/fsyUm0lJT6
OpenAI has harnessed the enhanced capabilities of GPT-3.5 and GPT-4 to equip ChatGPT with image comprehension skills. With this feature:
Users can upload images and pose various queries, such as asking ChatGPT to identify items within a picture for inventory purposes or analyze a chart to glean insights for a work presentation.
If you're using the mobile app, you even have the ability to draw on the image to emphasize specific elements.
Chat with images:
— Rowan Cheung (@rowancheung) September 25, 2023
ChatGPT's language reasoning skills can now understand images, photographs, screenshots, and documents containing text.
You can also discuss multiple images or use their new drawing tool to guide your assistant 🤯 pic.twitter.com/d4YZnhP0vr
For Plus and Enterprise users, access to these features will be granted within two weeks, with developers following closely afterward.
The Voice feature is available as an opt-in beta for ChatGPT app users, while the Image search feature will be automatically activated for all ChatGPT users across all platforms.