← All jobs · Mistral AI

Research Engineer, Machine Learning - Paris/London/Zurich/Warsaw

Mistral AI ·
73
AI-Agency
B92 U45
📍 Paris, FR 🌐 Remote/hybrid 🛂 Visa sponsor available Mid 4+ yrs
PyTorchJAXTensorFlowDeepSpeedFSDPSLURMKubernetesPython
TL;DR

Research Engineer at Mistral AI building large-scale ML training systems and pipelines for open-weight models. Work spans platform infrastructure and embedded research squads, optimizing distributed training on thousands of GPUs.

Apply at Mistral AI →
share:
you'll be redirected to the company's career page

Job description

About Mistral 
 
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
 
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
 
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
 
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
 
Mistral AI participates in the E-Verify program
 

By applying, you agree to our Applicant Privacy Policy.


Role Summary 
 
About the Research Engineering team
 
The team spans Platform (shared infra & clean code) and Embedded (inside research squads). Engineers can move along the research↔production spectrum as needs or interests evolve.
 
As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models. Working hand-in-hand with Research Scientists, you’ll either join:
 
- Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or
- Embedded RE Team: Sit inside a research squad (Alignment, Pre-training, Multimodal, Safety …) and turn fresh ideas into repeatable, scalable code.
 
 
What will you do
 
• Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
• Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
• Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
• Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
• Deliver prototypes that become production-grade components for Le Chat and our enterprise API.
 
About you
 
• Master’s or PhD in Computer Science (or equivalent proven track record).
• 4 + years working on large-scale ML codebases.
• Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
• Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
• Strong software-design instincts: testing, code review, CI/CD.
• Self-starter, low-ego, collaborative.
 
Benefits
 
France
💰 Competitive cash salary and equity
🥕 Food: Daily lunch vouchers
🥎 Sport: Monthly contribution to a Gympass subscription
🚴 Transportation: Monthly contribution to a mobility pass
🧑‍⚕️ Health: Full health insurance for you and your family
🍼 Parental: Generous parental leave policy
🌎 Visa sponsorship
 
UK
💰 Competitive cash salary and equity
🚑 Insurance
🚴 Transportation: Reimburse office parking charges, or  £90 per month for public transport
🥎 Sport:  £90 per month reimbursement for gym membership
🥕 Meal voucher: £200 monthly allowance for meals
💰 Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)
 
By applying, you agree to our Applicant Privacy Policy.
Apply at Mistral AI →

More open roles at Mistral AI

Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied Scientist / Research Engineer - Singapore
📍 Singapore, SG 🛠 AI tools welcome at work · Senior
Applied Scientist / Research Engineer at Mistral AI building state-of-the-art models across text, image, and speech modalities. Focus on pre-training, post-training, data curation, and deploying models on large GPU clusters.
PyTorchJAXPython
91
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied AI, Technical Lead, Forward Deployed AI Engineer - Montreal
📍 Montreal, CA 🛠 AI tools welcome at work · Lead
Technical Lead, Applied AI at Mistral AI leading teams to deploy complex AI solutions for enterprise customers. Hands-on role combining IC coding with team mentorship, pre-sales technical guidance, and RAG/agentic system design.
PythonPyTorchLangChainHugging FaceAWSGCP
85
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied AI, Technical Lead - Forward Deployed AI Engineer
📍 New York, US 🛂 Visa sponsor 🛠 AI tools welcome at work · Lead
Technical Lead, Applied AI at Mistral AI leading teams to deploy enterprise AI solutions. Hands-on role combining IC coding with team leadership on fine-tuning, RAG, and agentic systems.
PythonPyTorchLangChainHugging FaceAWSGCP
85
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
AI Scientist - Robotics
📍 Paris, FR 🛂 Visa sponsor · Senior
AI Scientist at Mistral AI researching and developing novel methods for mobile manipulation robots. Focus on building training and evaluation infrastructure for AI models at scale, with hands-on deployment on physical robot platforms.
PythonPyTorchVision-Language ModelsRobotics
85
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied AI, Technical Lead, Forward Deployed AI Engineer - EMEA
📍 Paris, FR · Lead
Technical Lead, Applied AI at Mistral AI leading teams to deploy complex AI solutions for enterprise customers. Hands-on role combining IC coding with team mentorship, pre-sales technical guidance, and oversight of fine-tuning, RAG, and agentic workflows.
PythonPyTorchLangChainHugging FaceAWSGCP
85
AI-core