← All jobs · Harvey

Senior Product Operations Manager, Evaluation

Harvey ·
67
AI-Agency
B65 U70
📍 San Francisco, US 💰 $178K–$210K Senior 4–7+ yrs
SQLPython
TL;DR

Senior Product Operations Manager at Harvey building evaluation infrastructure for legal AI. Responsible for scaling model evaluation systems, embedding evaluation workflows into product development, and managing human data vendors for a global agentic AI platform.

Apply at Harvey →
you'll be redirected to the company's career page

Job description

Why Harvey

At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.

This is a rare chance to help build a generational company at a true inflection point. With 1000+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.

Our team is sharp, motivated, and deeply committed to the mission. We move fast, operate with intensity, and take real ownership of the problems we tackle — from early thinking to long-term outcomes. We stay close to our customers — from leadership to engineers — and work together to solve real problems with urgency and care. If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.

At Harvey, the future of professional services is being written today — and we’re just getting started.

Role Overview

We’re looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harvey’s platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is mission-critical—and evaluation complexity is increasing 10x.

As a member of our Product Operations team, you’ll work closely with Applied Legal Researchers, Product, Engineering, AI Research, and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. You’ll create the workflows, systems, and tooling that make evaluation a first-class product capability at Harvey.

This is a high-ownership role for someone who thrives in ambiguity, loves building structure from ambiguity, and wants to help scale the evaluation infrastructure of a global AI company.

What You’ll Do

What You Have

Bonus Points

Compensation

$178,000 - $210,000 USD

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

#LI-CL1

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai

Apply at Harvey →

More open roles at Harvey

Harvey ·
Mid/Senior/Staff Software Engineer, Agents
📍 New York, US 💰 $165K–$312K 🛠 AI tools welcome at work · Mid
Mid/Senior/Staff Software Engineer building agentic AI systems at Harvey, a legal tech platform. Focus on agent design, prompt optimization, model selection, tool creation, and evaluation harnesses for professional services workflows.
PythonLLM APIsagent frameworksRAGprompt engineering
83
AI-core
Harvey ·
Software Engineer, AI Platform
📍 Toronto, CA 💰 $112K–$192K 🛠 AI tools welcome at work · Senior
Software Engineer at Harvey building AI platform infrastructure for agentic legal AI systems. Focus on model routing, context management, evaluation frameworks, and shared abstractions for multi-product teams.
PythonPyTorchKubernetesPostgreSQLAWS
83
AI-core
Harvey ·
Software Engineer, AI Platform
📍 New York, US 💰 $220K–$300K 🛠 AI tools welcome at work · Senior
Software Engineer at Harvey building AI platform infrastructure for agentic legal AI systems. Focus on model routing, context management, evaluation frameworks, and shared abstractions for product teams.
PythonPyTorchdistributed systemsKubernetesagent architecture
83
AI-core
Harvey ·
Software Engineer, AI Platform
📍 San Francisco, US 💰 $220K–$300K 🛠 AI tools welcome at work · Senior
Software Engineer at Harvey building AI platform infrastructure for agentic legal AI systems. Focus on model routing, agent architecture, context management, and evaluation frameworks that power Harvey's products.
PythonPyTorchLLMsagent architecturesdistributed systems
83
AI-core
Harvey ·
Senior Software Engineer, Fullstack - New Verticals
📍 New York, US 💰 $200K–$260K 🛠 AI tools welcome at work · Senior
Senior Fullstack Engineer at Harvey building AI-powered legal workflows. Embed with enterprise customers to design, prototype, and productionize agentic AI systems for professional services, integrating with client infrastructure and iterating on adoption.
LLMretrievalagentsprompt engineeringevals
83
AI-core