Eleven v3 is no longer in alpha, and is now generally available.We're pleased to reveal Eleven v3 (alpha) — the most expressive Text to Speech model.This research preview brings unprecedented control and realism to speech generation with:70+ languagesMulti-speaker dialogueAudio tags like [excited], [whispers], and [sighs]Eleven v3 (alpha) requires more prompt engineering than previous models — but the generations are breathtaking.If you’re working on videos, audiobooks, or media tools — this unlocks a new level of expressiveness. For real-time and conversational use cases, we recommend staying with v2.5 Turbo or Flash for now. A real-time version of v3 is in development.Eleven v3 is available today on our website and in the API. Why we built v3Since launching Multilingual v2, we’ve seen voice AI adopted in professional film, game development, education, and accessibility. But the consistent limitation wasn’t sound quality — it was expressiveness. More exaggerated emotions, conversational interruptions, and believable back-and-forth were difficult to achieve.Eleven v3 addresses this gap. It was built from the ground up to deliver voices that sigh, whisper, laugh, and react — producing speech that feels genuinely responsive and alive.What’s new in Eleven v3 (alpha)
Eleven v3: Most Expressive AI TTS Model Launched
Eleven v3 (alpha) introduces advanced audio tags, dialogue mode, and 70+ languages for nuanced, emotionally rich AI-generated speech.








