← All jobs · SambaNova

Senior Software Engineer, ML Infrastructure

SambaNova ·
71
AI-Agency
B88 U45
📍 US 🌐 Remote-only 💰 $200K–$275K Senior 5+ yrs
PythonvLLMTensorRT-LLMSGLangKubernetes
TL;DR

Senior Software Engineer at SambaNova building production-grade LLM inference infrastructure on custom RDU hardware. Focus on request scheduling, decoding algorithms, caching, and serving stack optimization.

Apply at SambaNova →
share:
you'll be redirected to the company's career page

Job description

The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.

SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the SambaNova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.

Overview

The Senior Software Engineer, ML Infrastructure will be responsible for designing, building, and operating the production-grade inference infrastructure that powers SambaNova's serving stack on our Reconfigurable Dataflow Unit (RDU) architecture. SambaNova is an inference-first company, and this role sits at the heart of that mission: turning state-of-the-art inference techniques into reliable, high-throughput, low-latency services exposed to customers through SambaStack and SambaCloud. The engineer will own end-to-end systems spanning request scheduling, advanced decoding algorithms, caching layers, API surfaces, and the accuracy infrastructure that keeps the stack trustworthy. This role partners closely with ML, compiler, runtime, and product teams to ship inference features from prototype to production.

Qualifications

Key responsibilities

 

Base Salary Range:

Base Pay Range
$200,000$275,000 USD

Submission Guidelines
Please note that in order to be considered an applicant for any position at SambaNova Systems, you must submit an application form for each position for which you believe you are qualified. 

EEO Policy
SambaNova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.

Benefits Summary for US-Based, Full-Time Employment Positions
SambaNova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.

Apply at SambaNova →

More open roles at SambaNova

SambaNova · 🔄 synced 2h ago
Senior AI Systems Performance Engineer
📍 Palo Alto, US · Senior
Senior ML performance engineer at SambaNova optimizing foundation models on the SambaNova dataflow platform. Focus on profiling, compiler/runtime tuning, and achieving state-of-the-art inference throughput and latency across distributed systems.
PythonC++PyTorchTensorFlowJAXCUDA
79
AI-core
SambaNova · 🔄 synced 2h ago
Director, Software Engineering
📍 San Jose, US 💰 $245K–$325K 🛠 AI tools welcome at work · Director
Director of Software Engineering at SambaNova leading the SambaStack inference serving platform team. Combines people leadership, hands-on technical contribution, and driving AI infrastructure initiatives for enterprise LLM workloads.
PythonGoRustKubernetesHelm Charts
76
AI-core
SambaNova · 🔄 synced 2h ago
ML Features Solutions Engineer
📍 Austin, US 💰 $200K–$270K · Senior
ML Features Solutions Engineer at SambaNova building production-grade ML features for enterprise AI deployment. Combines ML research translation, model optimization, and inference performance work on the SambaNova Suite platform.
PythonPyTorchTensorFlowJAXvLLMTensorRT-LLM
72
AI-fluent
SambaNova · 🔄 synced 2h ago
Principal Compiler Engineer - ML Systems
📍 San Jose, US 💰 $210K–$280K · Principal
Principal Compiler Engineer at SambaNova building compiler infrastructure and optimization algorithms for ML model performance on the SambaNova platform. Requires deep compiler fundamentals knowledge and experience with deep learning frameworks like PyTorch and TensorFlow.
PyTorchTensorFlowMLIRcompiler infrastructure
71
AI-fluent
SambaNova · 🔄 synced 2h ago
Software Engineer, ML Inference Performance
📍 Palo Alto, US · Principal
Principal Compiler Engineer at SambaNova building ML inference optimization on custom hardware. Focus on compiler infrastructure, PyTorch integration, and performance mapping to the SN40L chip.
PyTorchTensorFlowMLIRcompiler infrastructure
71
AI-fluent