← All jobs · Elicit

ML Research Resident

Elicit ·
87
AI-Agency
B88 U85
📍 Oakland, US 🌐 Remote/hybrid 💰 $144K–$180K 🛠 AI tools welcome at work Entry Contract
LLMsTransformers
TL;DR

ML Research Resident at Elicit developing computational operators for a research agent that iteratively improves knowledge states over thousands of steps. Focus on designing transparent, epistemically sound reasoning procedures for scientific document analysis and reasoning tasks.

Apply at Elicit →
share:
you'll be redirected to the company's career page

Job description

Elicit is building a research agent that can use an unlimited amount of test-time compute while keeping its reasoning transparent and verifiable.

The residency

Transformers do a fixed amount of computation per token, and the quality of work degrades rapidly when they are applied iteratively. As research resident, you'll work with us for 3 months on developing computational procedures (operators) that can reliably improve a knowledge state over thousands of iterations.

What is a knowledge state? A knowledge state consists of structured information - for example, a scientific paper might be represented as a set of claims supported by evidence and connected through logical reasoning; this might be combined with scratchpads, evergreen “notes to self”, search trees, and other information.

What counts as improvement? Like scientists, we want LLMs to make genuine progress in understanding - separating inferences from raw evidence, finding connections between ideas, building clearer explanations, and identifying gaps in reasoning. But unlike typical ML systems that are often trained to do “whatever works”, we need improvements that are epistemically sound - each step should make the knowledge state more useful while remaining human-readable. An improvement might reorganize information to better answer a question, find an implicit assumption in an argument, or connect evidence across multiple sources.

As research resident, your work will focus on designing and testing improvement operators that maintain stability over 1000+ iterations while making genuine progress. You'll start with simple cases (e.g., shallow refactoring of scientific papers) and demonstrate reliable iteration before scaling to more complex reasoning tasks.

Developing systems that perform legible reasoning over long horizons addresses core challenges in AI transparency and scalable reasoning.

About you

Strong candidates will have experience with LLMs, good intuitions about what makes reasoning systematic and verifiable, and care about AI transparency.

The best applicants will additionally have a strong software engineering background and concrete examples of how they've applied this background to come up with novel abstractions that push the frontiers of automated reasoning.

Logistics

Apply at Elicit →

More open roles at Elicit

Elicit · 🔄 synced 4h ago
Machine Learning Engineer
📍 Oakland, US 🌐 Remote 🛠 AI tools welcome at work · Mid
Machine Learning Engineer at Elicit building AI-powered research and decision-making systems. Focus on combining language models with data integrations, evaluation systems, and product interfaces for scientific teams.
Pythonlanguage modelsRAGAPIsevaluation systems
79
AI-core
Elicit · 🔄 synced 4h ago
Senior ML Product Manager
📍 Oakland, US 🌐 Remote 🛠 AI tools welcome at work · Senior
Senior ML Product Manager at Elicit, an AI research assistant using language models for literature review and research tasks. Lead ML-based product projects end-to-end, define product vision, and drive user research to scale reasoning capabilities.
LLMslanguage models
77
AI-core
Elicit · 🔄 synced 4h ago
AI Engineer
📍 Oakland, US 🌐 Remote 🛠 AI tools welcome at work · Mid
AI Engineer at Elicit building backend systems for an AI research assistant. Focus on prompt management, LLM orchestration, and distributed systems powering literature review and reasoning tasks.
Node.jsPythonNext.jsTypeScriptKubernetesGitHub
76
AI-core
Elicit · 🔄 synced 4h ago
Evaluation Engineer
📍 Oakland, US 🌐 Remote 🛠 AI tools welcome at work · Mid
Evaluation Engineer at Elicit building auto-evaluation systems for an AI research platform. Focus on infrastructure speed, interfaces for ML engineers and product managers, and statistical rigor for pharma decision-making assessments.
PythonTypeScriptasyncioPostgreSQL
69
AI-fluent
Elicit · 🔄 synced 4h ago
Senior Software Engineer
📍 Oakland, US 🌐 Remote 💰 $185K–$305K 🛠 AI tools welcome at work · Senior
Senior Software Engineer at Elicit, an AI research assistant using language models for literature review and reasoning tasks. Build and ship features across a full-stack Node/Python/Next.js system serving thousands of paying users.
Node.jsPythonNext.jsTypeScriptTailwindKubernetes
69
AI-fluent