← All jobs · Stripe

Engineering Manager, HADR

Stripe ·
53
AI-Agency
B55 U50
📍 US 🌐 Remote-only Manager 4+ yrs
MongoDBdistributed systemscloud infrastructure
TL;DR

Engineering Manager at Stripe leading the High Availability and Disaster Recovery team. Manages engineers building distributed systems for latency-critical, stateful applications across multiple regions.

Apply at Stripe →
you'll be redirected to the company's career page

Job description

<p><strong>Who we are</strong></p> <h3><strong>About Stripe</strong></h3> <p>Stripe is a financial infrastructure platform for businesses. Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.</p> <h3><strong>About the team</strong></h3> <p>In this role, you will be joining the High Availability and Disaster Recovery team. At Stripe, availability is a core feature of our products. This team designs and builds new solutions to allow latency-critical, stateful applications to survive any type of disaster. We build distributed systems on top of unreliable architecture to provide highly available and resilient customer solutions. This team is creating greenfield solutions which will serve as the basis for Stripe’s architecture 5, 10, or 20 years into the future.</p> <p>This is a distributed team with many remote engineers. You are encouraged to apply if you meet the minimum requirements and are able to work from anywhere in the United States or Canada.</p> <h2><strong>What you’ll do</strong></h2> <p>You will help develop our global architecture by combining less-available components and data centers into a highly available and resilient whole. You will work on latency-critical solutions where every millisecond matters and data redundancy is a hard requirement. You will learn quickly and work on a broad range of problems - one day may be investigating Mongo write concerns, the next may be minimizing cross-region TLS handshakes, followed by developing new systems to automate disaster detection and failovers. Your work will enable Stripe to increase the GDP of the internet by providing uptime and data protection which have historically been impossible.</p> <h3><strong>Responsibilities</strong></h3> <ul> <li>Lead and manage a team of talented engineers on the team, providing mentorship, guidance, and support to ensure their success.</li> <li>Drive the execution of projects, overseeing the entire development lifecycle from planning to delivery, while maintaining high standards of quality and timely completion.</li> <li>Help influence peers / managers and build consensus while dealing with ambiguity</li> <li>Build your team - formalizing role definitions, defining charter and ownership boundaries and taking a newly formed team into a high-functioning one&nbsp;</li> </ul> <h2><strong>Who you are</strong><strong><em>&nbsp;</em></strong></h2> <p>We’re looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.</p> <h3><strong>Minimum requirements</strong></h3> <p>This is where you’ll include the minimum requirements for the job. These are the absolute minimum experiences and skills needed to be considered for the position. Any candidate, whether outbound, inbound, or referred, who does not meet these, will not be considered so be fastidious when listing these.</p> <ul> <li>4+ years of software development experience&nbsp;</li> <li>2+ years of cloud development or management experience</li> <li>Professional working proficiency in English</li> </ul> <h3><strong>Preferred qualifications</strong></h3> <ul> <li>Understanding of distributed system concepts (ex. leader election, voting, quorum)</li> <li>Background in high-availability systems, chaos engineering, or disaster recovery design</li> <li>Experience with cloud infrastructure and multi-region deployments</li> <li>Familiarity with document databases such as MongoDB</li> </ul>
Apply at Stripe →

More open roles at Stripe

Stripe ·
Forward Deployed AI Accelerator, Marketing
📍 US 🌐 Remote 🛠 AI tools welcome · Mid
Forward Deployed AI Accelerator at Stripe's Marketing organization. Embed with marketing teams to build custom AI agents, automations, and tools that transform workflows. Coach marketers toward AI-first operations and scale successful patterns across the organization.
ClaudeClaude CodePythonAPI integrations
83
AI-core
Stripe ·
Forward Deployed AI Accelerator, Marketing
📍 US 🌐 Remote 🛠 AI tools welcome · Mid
Forward Deployed AI Accelerator at Stripe's Marketing organization. Embed with marketing teams to build custom AI agents, automations, and tools that transform workflows. Coach marketers toward AI-first operations and scale successful patterns across the organization.
ClaudeClaude CodePythonAPI integrations
83
AI-core
Stripe ·
Forward Deployed AI Accelerator, Marketing
📍 Singapore, SG 🛠 AI tools welcome · Mid
Forward Deployed AI Accelerator at Stripe embedded with marketing teams to build custom AI tools, agents, and automations. Coach marketers through AI adoption and systematically scale workflow transformations across the organization.
ClaudeClaude CodeAPI integrationsworkflow automation tools
81
AI-core
Stripe ·
Machine Learning Engineer, Stripe Assistant
📍 US 🌐 Remote 🛠 AI tools welcome · Senior
Senior Machine Learning Engineer at Stripe building the Stripe Assistant, an LLM-powered agent platform for merchant support and automation. Focus on agentic architecture, RAG, fine-tuning, and safe execution of high-trust actions at scale.
PythonLLMsRAGembeddingsRLHFagentic systems
81
AI-core
Stripe ·
Treasury Finance AI and Quantitative Analytics
📍 New York, US 🛠 AI tools welcome · Mid
Treasury Finance AI and Quantitative Analytics role at Stripe building autonomous agents and AI-powered tools for treasury workflows. Combines finance domain expertise with LLM frameworks (LangChain, LangGraph) and Python to automate liquidity management and risk analysis.
PythonPyTorchTensorFlowLangChainLangGraphStreamlit
79
AI-core