Mistral releases Voxtral, its first open source AI audio model

July 15, 2025

2

As AI systems become more capable, speech is fast becoming the default way we communicate with machines. French AI startup Mistral has jumped into the audio race with its first open model, aiming to challenge the dominance of walled-off corporate systems with open-weight alternatives.

On Tuesday, Mistral announced the release of Voxtral, its first family of audio models aimed at businesses.

The company is pitching Voxtral as the first open model that’s capable of deploying “truly usable speech intelligence in production.”

In other words, no longer will developers have to choose between a cheap, open system that fumbles transcriptions and doesn’t really understand what’s being said, and one that functions well, but is closed, leaving developers with a higher bill and less control over deployment.

For businesses, that means Voxtral offers an affordable alternative that the company claims is “less than half the price” of comparable solutions.

Mistral says Voxtral can transcribe up to 30 minutes of audio. Due to its LLM backbone, Mistral Small 3.1, it can understand up to 40 minutes, allowing users to ask questions about the audio content, generate summaries, or turn voice commands into real-time actions like calling APIs or running functions. Voxtral is also multilingual, with the ability to transcribe and understand languages including English, Spanish, French, Portuguese, Hindi, German, Dutch, and Italian.

The company is offering up two variants of its “speech understanding models”. The first, Voxtral Small, has 24B parameters for production-scale deployments, and is competitive with ElevenLabs Scribe, GPT-4o-mini, and Gemini 2.5 Flash.

The second, Voxtral Mini, has 3 billion parameters for local and edge deployments. There’s also an ultra-cheap, stripped-down, fast API version of the 3B model called Voxtral Mini Transcribe that is optimized for transcription-only use cases and promises to outperform OpenAI Whisper for less than half the price.

Users can try Voxtral for free by downloading the API on Hugging Face or testing the models in Mistral’s chatbot Le Chat. Integrating the API into applications starts at $0.001 per minute, according to the company.

The launch comes a month after Mistral announced Magistral, its first family of reasoning models that work through problems step-by-step for improved reliability.

Mistral, one of the top AI firms in Europe, is well-known for its advocacy pushing open source AI models. Earlier this month, TechCrunch reported that the company is in talks to raise up to $1 billion in equity from investors like Abu Dhabi’s MGX fund.

Previous articleSK Telecom Unveils Mobile AI Innovations with AX 3.1 Lite

Next articleDigitalization — Is It Time for Humans to Intervene?

Mistral releases Voxtral, its first open source AI audio model

Related Articles

It’s not you, it’s us: New report reveals why corporate-startup partnerships fall apart

Ruth Porat’s remarks at the Pennsylvania Energy & Innovation Summit

Meta fixes bug that could leak users’ AI prompts and generated content

LEAVE A REPLY Cancel reply

Latest Articles

It’s not you, it’s us: New report reveals why corporate-startup partnerships fall apart

Ruth Porat’s remarks at the Pennsylvania Energy & Innovation Summit

Meta fixes bug that could leak users’ AI prompts and generated content

CityFibre Secures £2.3B for UK Digital Expansion Boost

British startup Cryogenx raises €1.9 million to tackle deadly heat stress in military and industry

ABOUT US