📍 Showing US-based and remote roles by default.
View global →
7,673
jobs match
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
[Expression of Interest] Research Scientist / Engineer, Honesty
Research Scientist/Engineer at Anthropic focused on honesty and alignment. Develop techniques to minimize hallucinations, enhance truthfulness in language models, and create evaluation frameworks for model accuracy and calibration.
85
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Software Engineer, Safeguards
Software engineer at Anthropic building safety and oversight mechanisms for AI systems. Focus on monitoring models, detecting abuse, and preventing misuse through detection systems and real-time defenses.
62
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Software Engineer, Account Abuse
Software engineer at Anthropic building account abuse detection and enforcement systems. Focus on signal gathering, monitoring, and multi-layered defenses against bad actors using computing resources.
47
AI-touching
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Lead, Training Insights
Research Lead on Anthropic's Training Insights team, designing and leading novel evaluation methodologies to measure model capabilities across training and deployment. Hands-on leadership role spanning model development lifecycle, with cross-organizational impact on safety and capability measurement.
85
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Engineer, Machine Learning (Reinforcement Learning)
Research Engineer in Reinforcement Learning at Anthropic, building scalable RL infrastructure and training methodologies for large language models. Focus on agentic systems, tool use, code generation, and reasoning capabilities.
81
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Policy Manager, Chemical Weapons and High Yield Explosives
Policy Manager at Anthropic designing evaluation methodologies and safety strategies for AI systems handling chemical weapons and explosives information. Requires Ph.D. in Chemistry or Chemical Engineering with 5-8+ years in C/E defense and expertise in energetic materials.
59
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Offensive Security Research Engineer, Safeguards
Offensive Security Research Engineer at Anthropic focused on safeguarding LLMs from adversarial misuse. Role involves vulnerability research, exploitation scaffolds, and defensive strategy development to mitigate risks from AI-enabled attacks.
76
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Developer Education Lead
Developer Education Lead at Anthropic, owning end-to-end strategy and execution for developer-facing education content across video, tutorials, and courses. Role sits at intersection of DevRel, product, and education, working cross-functionally on launches and content roadmap prioritization.
52
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Software Engineer, Human Data Interface
Software engineer at Anthropic building data collection pipelines and interfaces for model training. Focus on full-stack infrastructure, crowdworker experience, and supporting research teams' evolving data needs.
59
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Safeguards Analyst, Account Abuse
Safeguards Analyst at Anthropic building account abuse detection and enforcement systems. Focus on developing signals, integrating third-party data, and operationalizing enforcement workflows to protect the platform at scale.
49
AI-touching
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Engineering Manager, Safeguards Data Infrastructure
Engineering Manager at Anthropic leading the Safeguards Data Infrastructure team. Responsible for building privacy-preserving data infrastructure, managing multi-cloud portability, and ensuring compliance with regulations like HIPAA across deployment environments.
59
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Threat Collections Engineer
Threat Collections Engineer at Anthropic building infrastructure for threat discovery and abuse detection. Develop automated detection systems, YARA rule infrastructure, and data pipelines integrating external threat intelligence sources.
52
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Software Engineer, Safeguards Infrastructure
Software engineer at Anthropic building safeguards infrastructure for AI systems. Focus on detection systems, real-time safety mechanisms, and operational tooling to prevent misuse and ensure user well-being.
69
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Scientist, Frontier Red Team (Emerging Risks)
Research Scientist at Anthropic's Frontier Red Team focusing on emerging societal risks from advanced AI systems. Design experiments, build evaluations, and develop defenses against risks from autonomous AI integration into business and infrastructure.
83
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Design Engineer, AI Capability Development (Education Labs)
Design Engineer at Anthropic's Education Labs building product features that help users develop real skill with AI. Full-stack role combining research insights, interaction design, and production engineering to ship capability-focused experiences.
75
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Technical CBRN-E Threat Investigator
Technical CBRN-E Threat Investigator at Anthropic detecting and investigating misuse of AI systems for chemical, biological, radiological, nuclear, and explosives threats. Role combines AI safety with biosecurity expertise, conducting threat analysis and developing detection capabilities.
49
AI-touching
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Scientist, Societal Impacts
Research Scientist at Anthropic analyzing Claude's real-world usage patterns and building evaluations to assess model safety and alignment. Role involves cross-functional collaboration with fine-tuning, safeguards, and policy teams to translate research insights into model improvements.
83
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Engineer / Research Scientist, Vision
Research Engineer at Anthropic building vision and spatial reasoning capabilities for Claude. Focus on multimodal model development, evaluation, and agentic infrastructure across pretraining, RL, and runtime techniques.
91
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Engineer, Frontier Red Team (Autonomy)
Research Engineer at Anthropic building autonomous AI systems and defensive agents to understand and counter adversarial AI. Focus on agent design, evals, robotics integration, and policy-relevant demonstrations.
93
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Technical Policy Manager, Cyber Harms
Technical Policy Manager, Cyber Harms at Anthropic leading a team to design safety systems that detect harmful cyber behaviors and prevent AI misuse. Combines deep cybersecurity expertise with policy development to shape responsible AI safety in the cybersecurity domain.
69
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Technical Cyber Threat Investigator
Technical Cyber Threat Investigator at Anthropic detecting and investigating misuse of Claude for malicious cyber operations. Role combines AI safety and cybersecurity, developing detection techniques and building defenses against AI-enabled threats.
43
AI-touching
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Engineer, Universes
Research Engineer at Anthropic building training environments for agentic AI systems. Role blends research and engineering, focusing on reinforcement learning, environment design, and capability evaluation for safe AI.
91
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Research Engineer, Cybersecurity Reinforcement Learning
Research Engineer at Anthropic building reinforcement learning systems for secure coding and cybersecurity. Role blends research and engineering, requiring domain expertise in cybersecurity and machine learning to develop RL environments and advance model capabilities in defensive security.
81
AI-core
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Biological Safety Research Scientist
Biological Safety Research Scientist at Anthropic designing safety systems to detect harmful AI behaviors and prevent misuse of biological knowledge. Role involves capability evaluations, threat modeling, and embedding biosecurity safeguards throughout model development.
71
AI-fluent
Anthropic
⚡ AI-native
·
🔄 synced 1h ago
Senior Research Scientist, Reward Models
Senior Research Scientist at Anthropic leading research on reward models and RLHF for large language models. Focus on novel architectures, LLM-based evaluation methods, and techniques to mitigate reward hacking.
89
AI-core