Voicebox: The Open-Source AI Voice Studio That Just Hit 28K Stars

I've been watching the voice AI space for a while. ElevenLabs does voice cloning incredibly well. WisprFlow nails voice dictation. But both live in the cloud, both cost money every month, and both require uploading your voice data to someone else's server.

That's why Voicebox caught my attention. 28.5k GitHub stars, MIT license, and it runs entirely on your machine. It combines what ElevenLabs does (voice output) with what WisprFlow does (voice input), ties them together with a local LLM, and wraps everything in a polished desktop app.

Clone Your Voice in Seconds

The voice cloning takes seconds of reference audio. Upload a short clip, and Voicebox builds a voice model that sounds like you. It covers 23 languages — English, Chinese, Japanese, Arabic, Hindi, Swahili, and more.