← All jobs · Mistral AI

Model Behavior Architect- Safety

Mistral AI ·
76
AI-Agency
B82 U65
📍 Paris, FR Senior
LLM evaluationsynthetic testingevaluation pipelines
TL;DR

Model Behavior Architect at Mistral AI defining and measuring LLM behavior. Role involves designing evaluation pipelines, writing policies, and identifying model improvements across reasoning, audio, alignment, and tool use.

Apply at Mistral AI →
share:
you'll be redirected to the company's career page

Job description

About Mistral 
 
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
 
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
 
We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
 
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.
 
Mistral AI participates in the E-Verify program
 

By applying, you agree to our Applicant Privacy Policy.


About the role
As a Model Behavior Architect, you are at the forefront of defining and measuring LLM behaviour.
 
We are looking for people who have built a career in engineering, machine learning, and large language models and are experts in model evaluation, policy writing, and creating eval pipelines for complicated tasks. Your role is to work hand-in-hand with our Science team to define what ‘good’ looks like for Reasoning, Audio, Alignment, Tools, and all Frontier bets.
 
Join us if you are passionate about tackling cutting-edge, open-ended research challenges and transforming your insights into best-in-class models.

What you will do

  • Interact with models to identify where model behavior can be improved
  • Gather internal and external feedback on model behavior to scope areas for improvement
  • Design and implement evals, data guidelines, data generation, and synthetic testing environments
  • Identify and fix edge case behaviors through rigorous testing
  • Develop robust evaluation pipelines for our model candidates
  • Work collaboratively with AI Scientists
  • About you

  • You have a deep understanding of either 1) linguistics, language, and translation, 2) engineering and code behavior, 3) LLM agents at work, including reasoning and tool use
  • You have prior knowledge in training and optimising model behaviour
  • You are an expert at building robust evaluations
  • You thrive in dynamic and technically complex environments
  • You have a track record of delivering innovative, out-of-the-box solutions to address real-world constraints
  •  
    By applying, you agree to our Applicant Privacy Policy.
    Apply at Mistral AI →

    More open roles at Mistral AI

    Mistral AI ⚡ AI-native · 🔄 synced 8h ago
    Applied Scientist / Research Engineer - Singapore
    📍 Singapore, SG 🛠 AI tools welcome at work · Senior
    Applied Scientist / Research Engineer at Mistral AI building state-of-the-art models across text, image, and speech modalities. Focus on pre-training, post-training, data curation, and deploying models on large GPU clusters.
    PyTorchJAXPython
    91
    AI-core
    Mistral AI ⚡ AI-native · 🔄 synced 8h ago
    Senior/Staff Applied Scientist/Research Engineer - EMEA
    📍 Paris, FR 🛂 Visa sponsor 🛠 AI tools welcome at work · Senior
    Senior/Staff Applied Scientist or Research Engineer at Mistral AI developing state-of-the-art models across text, image, and speech modalities. Responsibilities include pre-training, post-training, model deployment on large GPU clusters, data curation, and cross-functional collaboration on AI solutions.
    PyTorchJAXPython
    89
    AI-core
    Mistral AI ⚡ AI-native · 🔄 synced 8h ago
    Discovery Scientists - AI for Science - Paris
    📍 Paris, FR 🛂 Visa sponsor 🛠 AI tools welcome at work · Senior
    Discovery Scientist at Mistral AI building AI solutions for physics, chemistry, and materials science. Lead independent research projects leveraging deep learning and LLMs to accelerate scientific discovery and simulations.
    PyTorchTensorFlowPythondeep learningLLMs
    87
    AI-core
    Mistral AI ⚡ AI-native · 🔄 synced 8h ago
    Applied AI, Technical Lead, Forward Deployed AI Engineer - Munich
    📍 Munich, DE · Lead
    Technical Lead, Applied AI at Mistral AI leading teams to deploy complex AI solutions for enterprise customers. Hands-on role combining IC coding with team mentorship, pre-sales technical guidance, and RAG/agentic system design.
    PythonPyTorchLangChainHugging FaceAWSGCP
    85
    AI-core
    Mistral AI ⚡ AI-native · 🔄 synced 8h ago
    Applied AI, Technical Lead, Forward Deployed AI Engineer - Abu Dhabi
    📍 Abu Dhabi, AE 🛠 AI tools welcome at work · Lead
    Technical Lead, Applied AI at Mistral AI leading teams to deploy enterprise AI solutions. Hands-on role combining IC coding with team mentorship, pre-sales technical guidance, and RAG/agentic system design.
    PythonPyTorchLangChainHugging FaceAWSGCP
    85
    AI-core