Top 5 This Week

Related Posts

Microsoft-backed Mistral launches European AI cloud to compete with AWS and Azure


Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy.ย Learn more


Mistral AI, the French artificial intelligence startup, announced Wednesday a sweeping expansion into AI infrastructure that positions the company as Europeโ€™s answer to American cloud computing giants, while simultaneously unveiling new reasoning models that rival OpenAIโ€™s most advanced systems.

The Paris-based company revealed Mistral Compute, a comprehensive AI infrastructure platform built in partnership with Nvidia, designed to give European enterprises and governments an alternative to relying on U.S.-based cloud providers like Amazon Web Services, Microsoft Azure, and Google Cloud. The move represents a significant strategic shift for Mistral from purely developing AI models to controlling the entire technology stack.

โ€œThis move into AI infrastructure marks a transformative step for Mistral AI, as it allows us to address a critical vertical of the AI value chain,โ€ said Arthur Mensch, CEO and co-founder of Mistral AI. โ€œWith this shift comes the responsibility to ensure that our solutions not only drive innovation and AI adoption, but also uphold Europeโ€™s technological autonomy and contribute to its sustainability leadership.โ€

How Mistral built reasoning models that think in any language

Alongside the infrastructure announcement, Mistral unveiled its Magistral series of reasoning models โ€” AI systems capable of step-by-step logical thinking similar to OpenAIโ€™s o1 model and Chinaโ€™s DeepSeek R1. But Guillaume Lample, Mistralโ€™s chief scientist, says the companyโ€™s approach differs from competitors in crucial ways.

โ€œWe did everything from scratch, basically because we wanted to learn the expertise we have, like, flexibility in what we do,โ€ Lample told me in an exclusive interview. โ€œWe actually managed to be, like, a really, very efficient on the stronger online reinforcement learning pipeline.โ€

Unlike competitors that often hide their reasoning processes, Mistralโ€™s models display their full chain of thought to users โ€” and crucially, in the userโ€™s native language rather than defaulting to English. โ€œHere we have like the full chain of thought which is given to the user, but in their own language, so they can actually read through it, see if it makes sense,โ€ Lample explained.

The company released two versions: Magistral Small, a 24-billion parameter open-source model, and Magistral Medium, a more powerful proprietary system available through Mistralโ€™s API.

Why Mistralโ€™s AI models gained unexpected superpowers during training

The models demonstrated surprising capabilities that emerged during training. Most notably, Magistral Medium retained multimodal reasoning abilities โ€” the capacity to analyze images โ€” even though the training process focused solely on text-based mathematical and coding problems.

โ€œSomething we realized, not exactly by mistake, but something we absolutely did not expect, is that if at the end of the reinforcement learning training, you plug back the initial vision encoder, then you suddenly, kind of out of nowhere, see the model being able to do reasoning over images,โ€ Lample said.

The models also gained sophisticated function-calling abilities, automatically performing multi-step internet searches and code execution to answer complex queries. โ€œWhat you will see is a model doing this, thinking, then realizing, okay, this information might be updated. Let me do like a web search,โ€ Lample explained. โ€œIt will search on like internet, and then it will actually pass the results, and it will result over it, and it will say, maybe, maybe the answer is not in this results. Let me search again.โ€

This behavior emerged naturally without specific training. โ€œItโ€™s something that whether or not on things to do next, but we found that itโ€™s actually happening kind of naturally. So it was a very nice surprise for us,โ€ Lample noted.

The engineering breakthrough that makes Mistralโ€™s training faster than competitors

Mistralโ€™s technical team overcame significant engineering challenges to create what Lample describes as a breakthrough in training infrastructure. The company developed a system for โ€œonline reinforcement learningโ€ that allows AI models to continuously improve while generating responses, rather than relying on pre-existing training data.

The key innovation involved synchronizing model updates across hundreds of graphics processing units (GPUs) in real-time. โ€œWhat we did is that we found a way to just unscrew the model through GPUs. I mean, from GPU to GPU,โ€ Lample explained. This allows the system to update model weights across different GPU clusters within seconds rather than the hours typically required.

โ€œThere is no like open source infrastructure that will do this properly,โ€ Lample noted. โ€œTypically, there are a lot of like open source attempts to do this, but itโ€™s extremely slow. Here, we focused a lot on the efficiency.โ€

The training process proved much faster and cheaper than traditional pre-training. โ€œIt was much cheaper than regular pre training. Pre training is something that would take weeks or months on other GPUs. Here, we are nowhere close to this. It was like, I depend on how many people we put on this. But it was more like, it was like, fairly less than one week,โ€ Lample said.

Nvidia commits 18,000 chips to European AI independence

The Mistral Compute platform will run on 18,000 of Nvidiaโ€™s newest Grace Blackwell chips, housed initially in a data center in Essonne, France, with plans for expansion across Europe. Nvidia CEO Jensen Huang described the partnership as crucial for European technological independence.

โ€œEvery country should build AI for their own nation, in their nation,โ€ Huang said at a joint announcement in Paris. โ€œWith Mistral AI, we are developing models and AI factories that serve as sovereign platforms for enterprises across Europe to scale intelligence across industries.โ€

Huang projected that Europeโ€™s AI computing capacity would increase tenfold over the next two years, with more than 20 โ€œAI factoriesโ€ planned across the continent. Several of these facilities will have more than a gigawatt of capacity, potentially ranking among the worldโ€™s largest data centers.

The partnership extends beyond infrastructure to include Nvidiaโ€™s work with other European AI companies and Perplexity, the search company, to develop reasoning models in various European languages where training data is often limited.

How Mistral plans to solve AIโ€™s environmental and sovereignty problems

Mistral Compute addresses two major concerns about AI development: environmental impact and data sovereignty. The platform ensures that European customers can keep their information within EU borders and under European jurisdiction.

The company has partnered with Franceโ€™s national agency for ecological transition and Carbone 4, a leading climate consultancy, to assess and minimize the carbon footprint of its AI models throughout their lifecycle. Mistral plans to power its data centers with decarbonized energy sources.

โ€œBy choosing Europe for the location of our sites, we give ourselves the ability to benefit from largely decarbonized energy sources,โ€ the company stated in its announcement.

Speed advantage gives Mistralโ€™s reasoning models practical edge

Early testing suggests Mistralโ€™s reasoning models deliver competitive performance while addressing a common criticism of existing systems โ€” speed. Current reasoning models from OpenAI and others can take minutes to respond to complex queries, limiting their practical utility.

โ€œOne of the things that people usually donโ€™t like about this reasoning model is that even though itโ€™s smart, sometimes itโ€™s taking a lot of time,โ€ Lample noted. โ€œHere you really see the output in just a few seconds, sometimes less than five seconds, sometimes even less than this. And it changes the experience.โ€

The speed advantage could prove crucial for business adoption, where waiting minutes for AI responses creates workflow bottlenecks.

What Mistralโ€™s infrastructure bet means for global AI competition

Mistralโ€™s move into infrastructure puts it in direct competition with technology giants that have dominated the cloud computing market. Amazon Web Services, Microsoft Azure, and Google Cloud currently control the majority of cloud infrastructure globally, while newer players like CoreWeave have gained ground specifically in AI workloads.

The companyโ€™s approach differs from competitors by offering a complete, vertically integrated solution โ€” from hardware infrastructure to AI models to software services. This includes Mistral AI Studio for developers, Le Chat for enterprise productivity, and Mistral Code for programming assistance.

Industry analysts see Mistralโ€™s strategy as part of a broader trend toward regional AI development. โ€œEurope urgently needs to scale up its AI infrastructure if it wants to stay competitive globally,โ€ Huang observed, echoing concerns voiced by European policymakers.

The announcement comes as European governments increasingly worry about their dependence on American technology companies for critical AI infrastructure. The European Union has committed โ‚ฌ20 billion to building AI โ€œgigafactoriesโ€ across the continent, and Mistralโ€™s partnership with Nvidia could help accelerate those plans.

Mistralโ€™s dual announcement of infrastructure and model capabilities signals the companyโ€™s ambition to become a comprehensive AI platform rather than just another model provider. With backing from Microsoft and other investors, the company has raised over $1 billion and continues to seek additional funding to support its expanded scope.

But Lample sees even bigger possibilities ahead for reasoning models. โ€œI think when I look at the progress internally, and I think on some benchmarks, the model was getting a plus 5% accuracy every week for like, maybe like, six weeks in all,โ€ he said. โ€œSo it itโ€™s improving very fast on, there are many, many, I mean, ton of tons of like, you know, small ideas that you can think of that will improve the performance.โ€

The success of this European challenge to American AI dominance may ultimately depend on whether customers value sovereignty and sustainability enough to switch from established providers. For now, at least, they have a choice.

#Microsoftbacked #Mistral #launches #European #cloud #compete #AWS #Azure
source: https://venturebeat.com/ai/microsoft-backed-mistral-launches-european-ai-cloud-to-compete-with-aws-and-azure/

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Popular Articles