Storia in 1 fonti

New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent

Unlike GPT-4o or Qwen3.5-Omni, Audio Interaction doesn't wait for a recording to end: it translates, transcribes, chats, and picks up everyday noises like coughing in a single stream. Code, model weights, and download instructions are available on GitHub under the Apache 2.0 open-source license, with the training data to follow.

Raccontata da

the-decoder.com

Timeline cronologica

sabato 6 giugno 2026·the-decoder.com
New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent
Unlike GPT-4o or Qwen3.5-Omni, Audio Interaction doesn't wait for a recording to end: it translates, transcribes, chats, and picks up everyday noises like coughing in a single…