OpenAI is building a new voice model called GPT-Bidi-1 that could fundamentally change how ChatGPT handles spoken conversations. The key upgrade: the model can listen and speak simultaneously, rather than taking turns like a polite but slightly awkward dinner guest.
Code and UI elements related to GPT-Bidi-1 were discovered inside the ChatGPT app around June 16, 2026. No official announcement from OpenAI had been made as of June 23, 2026, but the internal preparations suggest the company is serious about making this happen.
What bidirectional audio actually means
In practical terms, this means the model can handle interruptions and modify its responses mid-sentence. If you cut ChatGPT off halfway through an answer to redirect the conversation, the model should be able to pivot without that jarring pause-and-restart that current voice modes suffer from.
The “Bidi” in the name stands for bidirectional, which is the core technical distinction. Current voice sessions predominantly use GPT-4o, a model that was never built from the ground up for real-time audio processing in both directions at once. GPT-Bidi-1 appears to be purpose-built for this exact use case.








