ChatGPT, can now see, hear, and speak.

Monday, January 1, 2024

Written by Isaac Castillo


Since the inception of ChatGPT by OpenAI, it has rapidly become an indispensable tool in people’s daily lives. This transformative technology has undergone significant advancements, with OpenAI recently announcing groundbreaking features set to further enhance ChatGPT’s capabilities, most notably by venturing into the realm of multimodality.

OpenAI has embarked on a journey of amalgamating its various artificial intelligence solutions to empower ChatGPT with novel functionalities that cater to an even wider array of user needs.

ChatGPT

Engage in Conversations with ChatGPT

One of the most exciting updates in ChatGPT’s repertoire is its newfound ability to engage in voice conversations, much like popular virtual assistants such as Siri, Alexa, or Google Assistant. ChatGPT leverages the “Whisper” speech-to-text service, which seamlessly converts the user’s spoken words into text that the model can comprehend and respond to effectively.

ChatGPT Voice

Voice Synthesis Technology

In addition to its remarkable voice recognition capabilities, OpenAI has introduced a cutting-edge technology that facilitates the conversion of ChatGPT-generated text into a synthesized and remarkably realistic voice. While details about this technology remain somewhat limited, OpenAI has emphasized its commitment to addressing concerns related to voice plagiarism and privacy. Their overarching objective is to create an artificial general intelligence (AGI) that is not only highly capable but also safe and beneficial for all.

Juniper Voice:

Ember Voice:

Breeze Voice:

Chat with Images

The innovation doesn’t stop there. OpenAI has integrated visionary technology into ChatGPT, endowing it with the ability to interpret and converse about images. For instance, if you’re unsure about which tool to use for a particular task, you can simply display your tools to ChatGPT, and it will provide you with a detailed recommendation based on visual analysis.

ChatGPT Image

The continuous evolution of ChatGPT exemplifies OpenAI’s dedication to pushing the boundaries of artificial intelligence. With its foray into multimodality, including voice interactions and image recognition, ChatGPT is poised to become an even more invaluable companion in our daily lives. As OpenAI addresses concerns regarding privacy and the ethical use of these technologies, the future of ChatGPT appears promising, offering a glimpse into the potential of safe and beneficial artificial general intelligence.

You can try this new version of ChatGPT in few days at chat.openai.com.