Stability AI launches Stable Audio 3.0 with up to six-minute tracks and open weights

Stability AI has unveiled Stable Audio 3.0, a new generation of audio models - three of which ship with open weights. The models generate music tracks up to six minutes long and were trained entirely on licensed data, according to the company.

mercoledì 20 maggio 2026 New tab

The model family includes four variants. Stable Audio 3.0 Small SFX and Stable Audio 3.0 Small each pack 459 million parameters and produce tracks up to two minutes long in 0.44 seconds of inference time on an H200 GPU. The first focuses on sound effects and is designed for smartphones and consumer laptops. The second targets short music pieces. Stable Audio 3.0 Medium runs 1.4 billion parameters and generates tracks up to 6:20 minutes in 1.31 seconds. All three are available as open-weights models on Hugging Face.

The largest model, Stable Audio 3.0 Large with 2.7 billion parameters, isn't available as open weights. It's only accessible through the Stability AI API, through partner fal.ai, or can be hosted on a company's own infrastructure via enterprise licensing. Stability AI says it delivers the highest musicality and is built for music platforms with high generation volume.

New architecture enables longer, more flexible audio output

Stable Audio 3.0 runs on a new architecture with a semantic-acoustic autoencoder that allows longer and more flexible audio output, according to Stability AI. Generation works at variable length with second-level control.

New architecture enables longer, more flexible audio output

Stability AI launches Stable Audio 3.0 with up to six-minute tracks and open weights

Stability AI launches Stable Audio 3.0 with up to six-minute tracks and open weights

Other newsrooms on this story

Related reading

Stability AI releases a new audio model that can create six-minute songs |…

Stability AI Launches Improved Music Models

Stability AI releases Stable Audio 3.0 with six-minute song generation…

ElevenLabs, Stability AI Drop New AI Music Models—Can They Catch Suno? - Decrypt

Stable Diffusion 3.5 is here – Replicate blog

ElevenLabs Music v2 promises opera-to-metal transitions without losing musical…

Other newsrooms on this story

Related reading

Stability AI releases a new audio model that can create six-minute songs |…

Stability AI Launches Improved Music Models

Stability AI releases Stable Audio 3.0 with six-minute song generation…

ElevenLabs, Stability AI Drop New AI Music Models—Can They Catch Suno? - Decrypt

Stable Diffusion 3.5 is here – Replicate blog

ElevenLabs Music v2 promises opera-to-metal transitions without losing musical…