GPT Chat continues to evolve, now with a significant voice upgrade: OpenAI has launched an update that integrates directly into the chat window and changes the way dialogues with the AI are conducted. The update allows full voice mode use while viewing responses and visual content in real-time, without switching to a separate screen.

Until now, voice mode was activated only in a dedicated window, which displayed an animated circle, without the ability to see text or images, making it difficult to follow the conversation’s content. Now, users can speak, see the response written in real-time, review previous messages, and examine images or maps sent during the conversation continuously.

This innovation represents a fundamental shift in interacting with the digital assistant. Instead of choosing in advance between a voice or text conversation, users can freely switch between the two within the same window. If a voice conversation is needed, press record and continue speaking. When you want to return to typing, stop the recording and click end. The entire experience stays on the same screen, providing a more natural sense of close conversation instead of constantly switching between interfaces.

The new update comes a day after GPT Chat’s shopping feature was enhanced. The company integrated deep research capabilities into the shopping experience, to the point where the system asks clarifying questions, scans diverse sources, and generates personalized purchase guides. Now it seems the company continues this momentum, simplifying the voice chat experience as part of a broader trend of unifying all AI capabilities into a single, efficient window.

This current change addresses one of the main criticisms from users of the previous voice mode. Many noted it was difficult to follow GPT Chat’s answers when they were spoken aloud without appearing as text. If a word or sentence was missed, there was no way to know what was said without returning to the text screen and disconnecting voice mode. Now, users can converse while still seeing every word on the screen, including images, graphs, and maps in real-time, significantly easing workflows, learning, or information searches.

The company notes that the new design will become the default across all platforms, both in the app and in browsers. Those who still prefer the old voice interface, based on a separate screen, can restore it in settings. Under the voice mode option, a new button will now allow selection of the separate mode. This preserves user choice while offering a more intuitive experience by default.

What is particularly interesting is how the two most recent updates, in both shopping and voice, indicate the company’s direction. The strategy seems to be to strengthen the sense of human interaction, make conversations smooth, and reduce technical decisions for users. The company promises to continue developing capabilities that unify voice, text, and visual content so that everything exists in a single coherent environment.