I benchmarked local voice-cloning models across English, German, Modern Standard Arabic, Spanish, and Mandarin Chinese.

Models:

OmniVoice int8

Chatterbox Multilingual fp16

VoxCPM2 bf16