The Speech-02 series from MiniMax are text-to-speech models that let you create natural-sounding voices with emotional expression. The models have support for more than 30 languages.

According to the Artificial Analysis Speech Arena, Speech-02-HD is the best text-to-speech model available today, while Speech-02-Turbo comes in third.

With Replicate, you can run these models with one line of code.

Listen to MiniMax Speech-02

Here’s a sample of the Speech-02-HD model reading an adapted version of this blog post, and the prediction that generated it.