Transcription, translation, summary … Mistral shows launches models centered in audio

In addition to these capacities, the first model, called Voxral, can soon recognize the characteristics of the interlocutors, as their age or sex, and their emotions.

The French Start-up of Artificial Intelligence (AI), Mistral, launched its first models on Tuesday focused on vocal recognition and transcription in several languages.

This open source model, called Voxtral, allows you to transcribe audio, live or imported files in several languages that range from English to Hindi, automatically recognized.

You can also summarize, responding to the requests raised orally and Mistral, it intends to add other characteristics, such as the recognition of several interlocutors and their characteristics (age, sex) but also their emotions, according to a press release.

Improve commercial vocal systems

Voxtral can be used in particular to improve commercial vocal systems to respond to its customers on the phone, according to the new company. The French company also develops with the Stellantis car manufacturer a system that allows drivers to interact orally with an AI assistant in their vehicle.

The American Mastodon OpenAI presented for its part last year, a vocal mode for its GPT-4O model, capable of “reasoning” in real time through audio, vision and text. This version of Chatgpt can significantly read users’ emotions on the faces through the camera of a smartphone.

The French Research Laboratory in Artificial Intelligence Kyutai, founded by Xavier Niel, owner of the ILIAD group, and Rodolphe Saadé, CEO of the CMA CMA maritime transporter, presented in February a simultaneous translation model. Called “Hibiki” (“echo” in Japanese), this one translates the words of a real French user into English, as an interpreter would.

Author: KD with AFP
Source: BFM TV

Magdalena

Brazil: The Prosecutor’s Office requests the conviction of former President Jair Bolsonaro for the “coup d’etat”

Budget 2026: For Olivier Faure, “the only possible perspective is censorship”

For retirees, it will be “a fiscal deduction of 2,000 euros” instead of the 10% reduction

Two men who had cut the most famous tree in England sentenced to 4 years in prison.

Transcription, translation, summary … Mistral shows launches models centered in audio

Improve commercial vocal systems

Brazil: The Prosecutor’s Office requests the conviction of former President Jair Bolsonaro for the “coup d’etat”

Budget 2026: For Olivier Faure, “the only possible perspective is censorship”

For retirees, it will be “a fiscal deduction of 2,000 euros” instead of the 10% reduction

Two men who had cut the most famous tree in England sentenced to 4 years in prison.

Brazil: The Prosecutor’s Office requests the conviction of former President Jair Bolsonaro for the “coup d’etat”

Budget 2026: For Olivier Faure, “the only possible perspective is censorship”

For retirees, it will be “a fiscal deduction of 2,000 euros” instead of the 10% reduction

Two men who had cut the most famous tree in England sentenced to 4 years in prison.

Does Wetransfer use your data to train AI?

LEAVE A REPLY Cancel reply

Editor Picks

The president of the Bundesbank defends new ECB rate hikes beyond what was expected

Xi Jinping expects a “greater contribution” from Macao and Hong Kong to the rise of China

Once adopted, you will no longer be able to do without the Sandisk 128 GB USB key

Latest News

Budget 2026: For Olivier Faure, “the only possible perspective is censorship”

For retirees, it will be “a fiscal deduction of 2,000 euros” instead of the 10% reduction

Two men who had cut the most famous tree in England sentenced to 4 years in prison.

Popular Categories