30.8 C
New York
Tuesday, July 15, 2025

Mistral releases Voxtral, its first open source AI audio model


As AI systems become more capable, speech is fast becoming the default way we communicate with machines. French AI startup Mistral has jumped into the audio race with its first open model, aiming to challenge the dominance of walled-off corporate systems with open-weight alternatives.  

On Tuesday, Mistral announced the release of Voxtral, its first family of audio models aimed at businesses.

The company is pitching Voxtral as the first open model that’s capable of deploying “truly usable speech intelligence in production.”

In other words, no longer will developers have to choose between a cheap, open system that fumbles transcriptions and doesn’t really understand what’s being said, and one that functions well, but is closed, leaving developers with a higher bill and less control over deployment. 

For businesses, that means Voxtral offers an affordable alternative that the company claims is “less than half the price” of comparable solutions.

Image Credits:Mistral

Mistral says Voxtral can transcribe up to 30 minutes of audio. Due to its LLM backbone, Mistral Small 3.1, it can understand up to 40 minutes, allowing users to ask questions about the audio content, generate summaries, or turn voice commands into real-time actions like calling APIs or running functions. Voxtral is also multilingual, with the ability to transcribe and understand languages including English, Spanish, French, Portuguese, Hindi, German, Dutch, and Italian.

The company is offering up two variants of its “speech understanding models”. The first, Voxtral Small, has 24B parameters for production-scale deployments, and is competitive with ElevenLabs Scribe, GPT-4o-mini, and Gemini 2.5 Flash. 

The second, Voxtral Mini, has 3 billion parameters for local and edge deployments. There’s also an ultra-cheap, stripped-down, fast API version of the 3B model called Voxtral Mini Transcribe that is optimized for transcription-only use cases and promises to outperform OpenAI Whisper for less than half the price.

Users can try Voxtral for free by downloading the API on Hugging Face or testing the models in Mistral’s chatbot Le Chat. Integrating the API into applications starts at $0.001 per minute, according to the company. 

The launch comes a month after Mistral announced Magistral, its first family of reasoning models that work through problems step-by-step for improved reliability. 

Mistral, one of the top AI firms in Europe, is well-known for its advocacy pushing open source AI models. Earlier this month, TechCrunch reported that the company is in talks to raise up to $1 billion in equity from investors like Abu Dhabi’s MGX fund.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles