1,180
jobs match
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
[Expression of Interest] Research Scientist / Engineer, Honesty
Research Scientist/Engineer at Anthropic focused on honesty and alignment. Develop techniques to minimize hallucinations, enhance truthfulness in language models, and create evaluation frameworks for model accuracy and calibration.
85
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Software Engineer, Safeguards
Software engineer at Anthropic building safety and oversight mechanisms for AI systems. Focus on monitoring models, detecting abuse, and preventing misuse through detection systems and real-time defenses.
62
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Lead, Training Insights
Research Lead on Anthropic's Training Insights team, designing and leading novel evaluation methodologies to measure model capabilities across training and deployment. Hands-on leadership role spanning model development lifecycle, with cross-organizational impact on safety and capability measurement.
85
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Machine Learning (Reinforcement Learning)
Research Engineer in Reinforcement Learning at Anthropic, building scalable RL infrastructure and training methodologies for large language models. Focus on agentic systems, tool use, code generation, and reasoning capabilities.
81
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Policy Manager, Chemical Weapons and High Yield Explosives
Policy Manager at Anthropic designing evaluation methodologies and safety strategies for AI systems handling chemical weapons and explosives information. Requires Ph.D. in Chemistry or Chemical Engineering with 5-8+ years in C/E defense and expertise in energetic materials.
59
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Offensive Security Research Engineer, Safeguards
Offensive Security Research Engineer at Anthropic focused on safeguarding LLMs from adversarial misuse. Role involves vulnerability research, exploitation scaffolds, and defensive strategy development to mitigate risks from AI-enabled attacks.
76
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Developer Education Lead
Developer Education Lead at Anthropic, owning end-to-end strategy and execution for developer-facing education content across video, tutorials, and courses. Role sits at intersection of DevRel, product, and education, working cross-functionally on launches and content roadmap prioritization.
52
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Software Engineer, Human Data Interface
Software engineer at Anthropic building data collection pipelines and interfaces for model training. Focus on full-stack infrastructure, crowdworker experience, and supporting research teams' evolving data needs.
59
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Engineering Manager, Safeguards Data Infrastructure
Engineering Manager at Anthropic leading the Safeguards Data Infrastructure team. Responsible for building privacy-preserving data infrastructure, managing multi-cloud portability, and ensuring compliance with regulations like HIPAA across deployment environments.
59
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Threat Collections Engineer
Threat Collections Engineer at Anthropic building infrastructure for threat discovery and abuse detection. Develop automated detection systems, YARA rule infrastructure, and data pipelines integrating external threat intelligence sources.
52
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Software Engineer, Safeguards Infrastructure
Software engineer at Anthropic building safeguards infrastructure for AI systems. Focus on detection systems, real-time safety mechanisms, and operational tooling to prevent misuse and ensure user well-being.
69
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Scientist, Frontier Red Team (Emerging Risks)
Research Scientist at Anthropic's Frontier Red Team focusing on emerging societal risks from advanced AI systems. Design experiments, build evaluations, and develop defenses against risks from autonomous AI integration into business and infrastructure.
83
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer / Scientist, Societal Impacts
Research Engineer/Scientist at Anthropic building infrastructure for studying societal impacts of AI systems. Focus on experimental design, data pipelines, evaluation tools, and cross-functional collaboration with researchers and policy teams.
69
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer / Research Scientist, Vision
Research Engineer at Anthropic building vision and spatial reasoning capabilities for Claude. Focus on multimodal model development, evaluation, and agentic infrastructure across pretraining, RL, and runtime techniques.
91
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Frontier Red Team (Autonomy)
Research Engineer at Anthropic building autonomous AI systems and defensive agents to understand and counter adversarial AI. Focus on agent design, evals, robotics integration, and policy-relevant demonstrations.
93
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Technical Policy Manager, Cyber Harms
Technical Policy Manager, Cyber Harms at Anthropic leading a team to design safety systems that detect harmful cyber behaviors and prevent AI misuse. Combines deep cybersecurity expertise with policy development to shape responsible AI safety in the cybersecurity domain.
69
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Universes
Research Engineer at Anthropic building training environments for agentic AI systems. Role blends research and engineering, focusing on reinforcement learning, environment design, and capability evaluation for safe AI.
91
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Cybersecurity Reinforcement Learning
Research Engineer at Anthropic building reinforcement learning systems for secure coding and cybersecurity. Role blends research and engineering, requiring domain expertise in cybersecurity and machine learning to develop RL environments and advance model capabilities in defensive security.
81
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Biological Safety Research Scientist
Biological Safety Research Scientist at Anthropic designing safety systems to detect harmful AI behaviors and prevent misuse of biological knowledge. Role involves capability evaluations, threat modeling, and embedding biosecurity safeguards throughout model development.
71
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Senior Research Scientist, Reward Models
Senior Research Scientist at Anthropic leading research on reward models and RLHF for large language models. Focus on novel architectures, LLM-based evaluation methods, and techniques to mitigate reward hacking.
89
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Scientist, Interpretability
Research Scientist in Interpretability at Anthropic, focused on mechanistic understanding of large language models. Develop methods to reverse-engineer neural network algorithms, design experiments at scale, and build infrastructure for interpretability research.
93
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Reward Models Platform
Research Engineer at Anthropic building infrastructure for reward model development. Focus on automating researcher workflows, enabling rapid iteration on reward signals, and scaling reward development across domains.
79
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Interpretability
Research Engineer, Interpretability at Anthropic building specialized inference and training infrastructure for mechanistic interpretability research. Focus on scaling bottlenecks, activation extraction tooling, and supporting safety audits on frontier models.
81
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Staff Research Engineer, Discovery Team
Staff Research Engineer at Anthropic working on the Discovery Team to build AI systems capable of scientific reasoning and computer use. Role spans model training, evaluation, inference optimization, and distributed systems to advance toward scientific AGI.
87
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 6h ago
Research Engineer, Production Model Post-Training
Research Engineer at Anthropic building post-training pipelines for production Claude models. Focus on implementing Constitutional AI, RLHF, and alignment techniques at scale on frontier models.
93
AI-core