Mistral’s first open-weight TTS model Voxtral clones voices from three seconds of audio across nine languages

2026-03-26 10:16 GMT · 1 day ago aimagpro.com

French AI startup Mistral has released Voxtral TTS, its first text-to-speech model that supports nine languages and can clone voices from just three seconds of audio.
The article Mistral's first open-weight TTS model Voxtral clones voices from three seconds of audio across nine languages appeared first on The Decoder.