← All jobs · Mistral AI

AI Engineer, Product

Mistral AI ·
78
AI-Agency
B72 U88
📍 Paris, FR 🌐 Remote/hybrid 🛠 AI tools welcome at work Mid 3–4+ yrs
TypeScriptPythonLLMAB testingobservability
TL;DR

AI Engineer at Mistral AI improving AI-powered product features through evaluation design, prompt engineering, and A/B testing. Owns AI quality end-to-end for search, chat, documents, or audio domains.

Apply at Mistral AI →
share:
you'll be redirected to the company's career page

Job description

About Mistral 
 
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
 
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.
 
We are a dynamic, collaborative team passionate about AI and its potential to transform society.
Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
 
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
 
Role summary
 
Embedded directly in a product team as search, chat, documents, or audio, you'll improve AI-powered features through rigorous evaluation, prompt and orchestration design, and rapid experimentation. You'll own your domain's AI quality end-to-end: define what "good" looks like, measure it, run experiments, and ship what works. Work with Science to deliver measurable improvements to quality, latency, safety, and reliability.
 
 
What you will do
 
• Design and run evaluations for your product area: reference tests, heuristics, model-graded checks tailored to search relevance, chat quality, document understanding, or audio performance.
• Define and track metrics that matter: task success, helpfulness, hallucination proxies, safety flags, latency, cost.
• Own prompt and orchestration design: write, test, and iterate on prompts and system prompts as a core part of your work.
• Run A/B tests on prompts, models, and configurations; analyze results; make rollout or rollback decisions from data.
• Set up observability for LLM calls: structured logging, tracing, dashboards, alerts.
• Operate model releases: canary and shadow traffic, sign-offs, SLO-based rollback criteria, regression detection.
• Improve core behaviors in your product area, whether that's memory policies, intent classification, routing, tool-call reliability, or retrieval quality.
• Create templates and documentation so other teams can author evals and ship safely.
• Partner with Science to diagnose regressions and lead post-mortems.
 
About you
 
 
• 3-4 years of experience; backgrounds that fit well include ML engineers moving closer to product, or software engineers with real AI/ML production experience.
• Strong TypeScript or Python skills - we have both tracks depending on team fit.
• Production LLM experience: prompts, tool/function calling, system prompts.
• Hands-on with evals and A/B testing; you can design metrics, not just run them.
• Comfortable implementing directly in product code, not only notebooks.
• Observability experience: logging, tracing, dashboards, alerting.
• Product mindset: form hypotheses, run experiments, interpret results, ship.
• Clear communication, autonomous, and oriented toward production impact over experimentation for its own sake.
 
It would be ideal if you also have:
• Safety systems experience: moderation, PII handling/redaction, guardrails.
• Release operations: canary/shadowing, automated rollbacks, experiment platforms.
• Prior work on search ranking, chat systems, document AI, or audio ML features.
 
Hiring Process
 
• Introduction call - 30 min
• Hiring Manager interview - 30 min
• Technical Rounds
- Live-coding Interview - 45 min
- AI Engineering Interview - 45 min
• Culture-fit discussion - 30 min
• References
 
By applying, you agree to our Applicant Privacy Policy.
Location & Remote   The position is based in our Paris HQ offices and we encourage going to the office as much as we can (at least 3 days per week) to create bonds and smooth communication. Our remote policy aims to provide flexibility, improve work-life balance and increase productivity. Each manager can decide the amount of days worked remotely based on autonomy and a specific context (e.g. more flexibility can occur during summer). In any case, employees are expected to maintain regular communication with their teams and be available during core working hours.   What we offer   💰 Competitive salary and equity package 🧑‍⚕️ Health insurance 🚴 Transportation allowance 🥎 Sport allowance 🥕 Meal vouchers 💰 Private pension plan 🍼 Generous parental leave policy   By applying, you agree to our Applicant Privacy Policy.

Location & Remote
 
The position is based in our Paris HQ offices and we encourage going to the office as much as we can (at least 3 days per week) to create bonds and smooth communication. Our remote policy aims to provide flexibility, improve work-life balance and increase productivity. Each manager can decide the amount of days worked remotely based on autonomy and a specific context (e.g. more flexibility can occur during summer). In any case, employees are expected to maintain regular communication with their teams and be available during core working hours.
 
What we offer
 
💰 Competitive salary and equity package
🧑‍⚕️ Health insurance
🚴 Transportation allowance
🥎 Sport allowance
🥕 Meal vouchers
💰 Private pension plan
🍼 Generous parental leave policy
 
By applying, you agree to our Applicant Privacy Policy.
Apply at Mistral AI →

More open roles at Mistral AI

Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied Scientist / Research Engineer - Singapore
📍 Singapore, SG 🛠 AI tools welcome at work · Senior
Applied Scientist / Research Engineer at Mistral AI building state-of-the-art models across text, image, and speech modalities. Focus on pre-training, post-training, data curation, and deploying models on large GPU clusters.
PyTorchJAXPython
91
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied AI, Technical Lead, Forward Deployed AI Engineer - Montreal
📍 Montreal, CA 🛠 AI tools welcome at work · Lead
Technical Lead, Applied AI at Mistral AI leading teams to deploy complex AI solutions for enterprise customers. Hands-on role combining IC coding with team mentorship, pre-sales technical guidance, and RAG/agentic system design.
PythonPyTorchLangChainHugging FaceAWSGCP
85
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied AI, Technical Lead - Forward Deployed AI Engineer
📍 New York, US 🛂 Visa sponsor 🛠 AI tools welcome at work · Lead
Technical Lead, Applied AI at Mistral AI leading teams to deploy enterprise AI solutions. Hands-on role combining IC coding with team leadership on fine-tuning, RAG, and agentic systems.
PythonPyTorchLangChainHugging FaceAWSGCP
85
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
AI Scientist - Robotics
📍 Paris, FR 🛂 Visa sponsor · Senior
AI Scientist at Mistral AI researching and developing novel methods for mobile manipulation robots. Focus on building training and evaluation infrastructure for AI models at scale, with hands-on deployment on physical robot platforms.
PythonPyTorchVision-Language ModelsRobotics
85
AI-core
Mistral AI ⚡ AI-native · 🔄 synced 3h ago
Applied AI, Technical Lead, Forward Deployed AI Engineer - EMEA
📍 Paris, FR · Lead
Technical Lead, Applied AI at Mistral AI leading teams to deploy complex AI solutions for enterprise customers. Hands-on role combining IC coding with team mentorship, pre-sales technical guidance, and oversight of fine-tuning, RAG, and agentic workflows.
PythonPyTorchLangChainHugging FaceAWSGCP
85
AI-core