Baseten jobs: 53 open roles

Baseten · 42d ago 🔄 synced 38m ago

Software Engineer - Voice AI (Inference Runtime)

📍 San Francisco, US 🛠 AI tools welcome at work · Senior

Software engineer at Baseten building the Voice AI inference runtime stack. Focus on model serving, real-time latency optimization, and large-scale infrastructure for STT, TTS, and voice agent workloads.

PythonvLLMTensorRTONNXDockerKubernetes

83

AI-core

Baseten · 79d ago 🔄 synced 38m ago

Post-Training Research Scientist

📍 San Francisco, US · Senior

Post-Training Research Scientist at Baseten, an AI inference platform. Pursue foundational and applied research in post-training methodology and inference optimization, with direct collaboration on production systems serving major AI companies.

PyTorchJAXTensorFlowTransformersvLLM

83

AI-core

Baseten · 100d ago 🔄 synced 38m ago

Software Engineer - AI Enablement

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

Software engineer at Baseten building internal AI agents and LLM-powered workflows to boost engineering productivity. Focus on evaluating, deploying, and customizing AI coding assistants and custom agents for the engineering org.

PythonLLM agentsClaude CodeCursorCodex

80

AI-core

Baseten · 27d ago 🔄 synced 38m ago

Engineering Manager - Forward Deployed Engineering (LLM)

📍 San Francisco, US 🛠 AI tools welcome at work · Manager

Engineering Manager at Baseten leading a Forward Deployed Engineering team building and optimizing LLM inference systems for production customers. Hands-on player-coach role combining team leadership with direct technical contribution to customer deployments and core platform features.

PythonvLLMTensorRTTritonHugging FaceRay Serve

79

AI-core

Baseten · 44d ago 🔄 synced 38m ago

Applied AI Inference Engineer

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

Applied AI Inference Engineer at Baseten building production AI systems for customers. Role combines hands-on Python development, customer-facing implementation, and product engineering to deploy high-scale inference applications on Baseten's platform.

PythonDockerML model deploymentinference optimization

77

AI-core

Baseten · 44d ago 🔄 synced 38m ago

AI Solutions Engineer

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

AI Solutions Engineer at Baseten, an inference platform for AI companies. Role combines hands-on Python development with customer-facing implementation, deploying production AI systems end-to-end from design through monitoring.

PythonDockerComfyUIWhisper

77

AI-core

Baseten · 798d ago 🔄 synced 38m ago

Forward Deployed Engineer

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

Forward Deployed Engineer at Baseten, an AI inference platform. You'll architect and deploy production AI applications for customers like Cursor and Notion, combining hands-on coding with customer-facing implementation and product work.

PythonDockerML model deploymentAPI development

77

AI-core

Baseten · 133d ago 🔄 synced 38m ago

Software Engineer - Training Product

📍 San Francisco, US 🛠 AI tools welcome at work · Senior

Software engineer at Baseten building training infrastructure for AI companies. Own features like multi-node training and serverless RL from conception to production, working across API, backend, and infrastructure layers.

PyTorchKubernetesvLLMNCCLMegatronDeepSpeed

76

AI-core

Baseten · 73d ago 🔄 synced 38m ago

Post-Training Research Engineer

📍 San Francisco, US · Mid

Post-Training Research Engineer at Baseten building in-house tooling for custom model training. Focus on distributed GPU training, transformer parallelism, and performance optimization across the ML stack.

PyTorchTensorFlowJAXKubernetesRaySlurm

73

AI-fluent

Baseten · 322d ago 🔄 synced 38m ago

Software Engineer - GPU Kernels

📍 San Francisco, US · Mid

GPU Kernel Engineer at Baseten building high-performance CUDA kernels for ML inference optimization. Focus on matrix multiplications, attention mechanisms, and quantization for production AI systems.

CUDAC++PTXNsight SystemsNsight ComputeTorch Profiler

73

AI-fluent

Baseten · 101d ago 🔄 synced 38m ago

Software Engineer — GPU Networking & Distributed Systems

📍 San Francisco, US · Senior

Software engineer at Baseten building GPU networking and distributed inference infrastructure. Focus on RDMA integration, optimizing communication for multi-node LLM serving, and characterizing performance on cutting-edge hardware clusters.

C++PythonNCCLNVSHMEMUCXTensorRT-LLM

72

AI-fluent

Baseten · 630d ago 🔄 synced 38m ago

Engineering Manager - Model Performance

📍 San Francisco, US · Manager

Engineering Manager at Baseten leading a team focused on ML model inference and performance optimization. Responsibilities include managing engineers, optimizing LLM inference stacks, and driving production-scale ML deployment across frameworks like PyTorch, TensorRT, and CUDA.

PythonC++GoPyTorchTensorRTCUDA

72

AI-fluent

Baseten · 27d ago 🔄 synced 38m ago

Manager, Solutions Architect

📍 San Francisco, US 🛠 AI tools welcome at work · Manager

Manager of Solutions Architects at Baseten, an AI inference platform. Lead a team translating customer needs into technical solutions for LLM deployments, run technical discovery and POCs, and mentor architects on model serving optimization.

vLLMsglangTRT-LLMDocker

71

AI-fluent

Baseten · 798d ago 🔄 synced 38m ago

Software Engineer - Model Performance

📍 San Francisco, US · Mid

Software engineer at Baseten optimizing ML model inference performance. Focus on quantization, speculative decoding, and LLM serving infrastructure using PyTorch, TensorRT, and CUDA.

PythonC++PyTorchTensorRTTensorRT-LLMvLLM

71

AI-fluent

Baseten · 3d ago 🔄 synced 38m ago

Software Engineer- BIS (Baseten Inference Stack)

📍 San Francisco, US · Mid

Software Engineer on Baseten's Inference Stack team building distributed runtime for large-scale LLM inference. Work across the stack from developer experience to Kubernetes orchestration and traffic routing, enabling customers to deploy cutting-edge models with industry-leading performance and reliability.

KubernetesvLLMSGLangTensorRT-LLMDynamoPython

67

AI-fluent

Baseten · 44d ago 🔄 synced 38m ago

GTM Engineer

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

GTM Engineer at Baseten building AI-powered workflows for sales, marketing, and support. Owns CRM enrichment strategy, audits and ships productivity tools using Clay, Salesforce, and AI coding assistants.

ClaySalesforceClaude CodeCursorn8nZapier

67

AI-fluent

Baseten · 22d ago 🔄 synced 38m ago

Engineering Manager, Model Library

📍 San Francisco, US · Manager

Engineering Manager leading the Model Library team at Baseten, an AI inference platform. Responsible for building developer-facing APIs, model discovery, and evaluation frameworks for frontier AI models.

APIsSDKsLLM runtimesinference platforms

66

AI-fluent

Baseten · 149d ago 🔄 synced 38m ago

Software Engineer, Model Performance Systems

📍 San Francisco, US · Entry

Software Engineer at Baseten building performance benchmarking and diagnostic tools for LLM inference infrastructure. Focus on GPU profiling, cluster validation, and automated performance testing across hardware systems.

PythonPyTorchNVIDIA Nsight SystemsC++InfiniBandGPU

65

AI-fluent

Baseten · 237d ago 🔄 synced 38m ago

Software Engineer - Model Products

📍 San Francisco, US · Mid

Software engineer at Baseten building Model APIs for LLM inference serving. Focus on distributed systems, GPU optimization, and developer experience for hosted model endpoints.

TensorRT-LLMvLLMSGLangCUDAKubernetesTensorRT

65

AI-fluent

Baseten · 63d ago 🔄 synced 38m ago

Product Manager - Dedicated Inference

📍 San Francisco, US · Mid

Product Manager at Baseten building the core developer experience for AI inference platform. Owns roadmap for APIs, SDKs, UI workflows, and integration surfaces that enable teams to deploy and manage AI applications.

APIsSDKsML inferencemodel training

63

AI-fluent

Baseten · 14d ago 🔄 synced 38m ago

Data Center Network Engineer

📍 San Francisco, US · Senior

Data center network engineer at Baseten designing and operating high-performance GPU cluster infrastructure. Focus on InfiniBand/Ethernet fabric design, topology optimization, and performance validation for AI inference and training workloads.

InfiniBandRDMAEthernetGPU clusters

62

AI-fluent

Baseten · 18d ago 🔄 synced 38m ago

Engineering Manager, Cloud Platform

📍 San Francisco, US · Manager

Engineering Manager at Baseten leading a cloud platform team building inference infrastructure for AI companies. Responsible for team hiring, technical direction, reliability standards, and incident response for production ML systems.

KubernetesTerraformCloudFormationPulumiGitHub ActionsGitLab CI

61

AI-fluent

Baseten · 100d ago 🔄 synced 38m ago

Software Engineer - Baseten for Labs

📍 San Francisco, US · Mid

Full-stack software engineer at Baseten building inference infrastructure and developer-facing products for AI model labs. Focus on API gateways, model discovery, auth, and multi-tenant systems serving frontier AI companies.

ReactTypeScriptKubernetesAPI gatewaysdistributed systems

61

AI-fluent

Baseten · 279d ago 🔄 synced 38m ago

Software Engineer - Training Infrastructure

📍 San Francisco, US · Mid

Software Engineer on Baseten's Training Infrastructure team designing and architecting scalable ML training platforms. Focus on scheduling, storage, networking, and observability for distributed training workloads at scale.

GoPythonKubernetesAWSGCPPyTorch

61

AI-fluent

Baseten · 288d ago 🔄 synced 38m ago

Software Engineer - Enterprise Platform

📍 San Francisco, US · Senior

Senior Enterprise Platform Engineer at Baseten building infrastructure for AI inference. Focus on multi-cloud capacity management, self-hosted clusters, enterprise security, and Kubernetes-based solutions for mission-critical AI workloads.

Kubernetesdistributed systemscloud infrastructureMLOps

61

AI-fluent

Baseten · 695d ago 🔄 synced 38m ago

Software Engineer - Dedicated Inference

📍 San Francisco, US · Mid

Software engineer at Baseten building the developer experience for deploying and operating AI inference workloads in production. Work spans CLI, SDKs, APIs, observability, and debugging tools for mission-critical deployments.

PythonGoJavaScriptReactKubernetesPostgreSQL

61

AI-fluent

Baseten · 24d ago 🔄 synced 38m ago

SRE

📍 San Francisco, US 🛠 AI tools welcome at work · Senior

Site Reliability Engineer at Baseten building observability and automation for ML inference infrastructure. Focus on Kubernetes reliability, incident response, and operational tooling for multi-cloud deployments serving AI companies.

KubernetesTerraformHelmPrometheusVictoriaMetricsGrafana

59

AI-fluent

Baseten · 452d ago 🔄 synced 38m ago

Software Engineer - Infrastructure

📍 San Francisco, US · Mid

Infrastructure Software Engineer at Baseten building ML inference platform components. Focus on Kubernetes deployments, resource management, and monitoring systems for production AI applications.

PythonGoKubernetesPrometheusdistributed systems

57

AI-fluent

Baseten · 18d ago 🔄 synced 38m ago

Senior Manager, Cloud Platform & Site Reliability

📍 San Francisco, US · Director

Senior Manager of Cloud Platform & Site Reliability at Baseten, leading infrastructure and SRE teams. Oversees Kubernetes, multi-cloud infrastructure, and reliability standards for an AI inference platform serving companies like Cursor and Notion.

KubernetesTerraformGitHub ActionsPrometheusGrafanaOpenTelemetry

55

AI-fluent

Baseten · 34d ago 🔄 synced 38m ago

OS / K8s Systems Engineer

📍 San Francisco, US · Senior

OS/Kubernetes systems engineer at Baseten building automation and infrastructure for GPU compute. Focus on cluster provisioning, OS image design, and orchestration systems that enable AI companies to deploy models at scale.

KubernetesPythonGoLinuxPXEGPU

55

AI-fluent

Baseten · 87d ago 🔄 synced 38m ago

Infrastructure Ops Engineer

📍 San Francisco, US · Entry

Infrastructure Ops Engineer at Baseten managing GPU fleet operations and capacity fulfillment for AI inference. Coordinates hardware lifecycles, Kubernetes cluster maintenance, and customer capacity requests across global infrastructure.

KubernetesGPUH100A100B200cloud-native

55

AI-fluent

Baseten · 238d ago 🔄 synced 38m ago

Cloud Platform Engineer

📍 San Francisco, US · Mid

Cloud Platform Engineer at Baseten building scalable infrastructure for ML model deployment and inference. Focus on Kubernetes, CI/CD automation, multi-cloud capacity management, and reliability systems for AI companies.

KubernetesTerraformGitHub ActionsPrometheusGrafanaAWS

55

AI-fluent

Baseten · 36d ago 🔄 synced 38m ago

Engineering Manager, Internal Platform

📍 San Francisco, US 🛠 AI tools welcome at work · Manager

Engineering Manager at Baseten leading the internal platform team. Responsible for building developer tooling, CI/CD infrastructure, and shared libraries that amplify productivity across the engineering organization.

GoPythonKubernetesDockerHelmBazel

53

AI-fluent

Baseten · 78d ago 🔄 synced 38m ago

Data Engineer

📍 San Francisco, US · Mid

Data Engineer at Baseten building internal data platforms and analytics infrastructure for an AI inference company. Design data models, pipelines, and metrics to support product, engineering, and business teams.

Apache BeamKafkaAirflowOpenTelemetryGrafana

53

AI-fluent

Baseten · 25d ago 🔄 synced 38m ago

Solution Architect (AI/LLM Inference)

📍 San Francisco, US 🛠 AI tools welcome at work · Senior

Solution Architect at Baseten guiding customers on AI/LLM inference deployments. Leads technical discovery, demos, benchmarking, and POC execution for companies adopting inference infrastructure at scale.

vLLMSGLangTRT-LLMH100B200

52

AI-fluent

Baseten · 99d ago 🔄 synced 38m ago

Solution Architect

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

Solution Architect at Baseten, an AI inference platform. Partner with sales and customers on technical discovery, demos, and POC execution for ML model deployments. Scope solutions, run benchmarks, and guide customers through inference infrastructure tradeoffs.

vLLMSGLangTRT-LLMPyTorchCUDA

51

AI-fluent

Baseten · 56d ago 🔄 synced 38m ago

Content Engineer

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

Content Engineer at Baseten building technical content and SEO/AEO strategies for AI inference platform. Role combines technical writing, GTM automation, and agentic AI workflows to reach developer audiences.

PythonJavascriptSQLRetoolZapiern8n

45

AI-touching

Baseten · 24d ago 🔄 synced 38m ago

Cost Analytics Lead

📍 San Francisco, US · Senior

Cost Analytics Lead at Baseten building data models and dashboards for cloud infrastructure cost, usage, and capacity tracking across an AI inference platform. Works with Finance, Infrastructure, and Product teams to optimize resource allocation and forecasting.

SQLdbtPythonAWSGoogle CloudSigma

43

AI-touching

Baseten · 2d ago 🔄 synced 38m ago

Head of Legal Operations

📍 San Francisco, US 🛠 AI tools welcome at work · Director

Head of Legal Operations at Baseten, an AI inference platform. Own contract lifecycle management, legal intake workflows, AI-assisted tooling strategy, and reporting systems for a high-growth legal function scaling toward IPO.

Claude CoworkSlack

42

AI-touching

Baseten · 64d ago 🔄 synced 38m ago

Security Engineer

📍 San Francisco, US · Mid

Security Engineer at Baseten building security infrastructure for an ML inference platform. Responsibilities include cloud security architecture, vulnerability management, incident response, IAM, compliance, and DevSecOps integration.

AWSGCPKubernetesSIEMIDSIPS

33

AI-touching

Baseten · 98d ago 🔄 synced 38m ago

Software Engineer - Billing and Internal Tooling

📍 San Francisco, US · Mid

Software engineer at Baseten building billing and revenue infrastructure for an AI inference platform. Owns pricing, invoicing, metering, and internal tooling that supports Finance, Sales, and GTM teams.

OrbPostgreSQLweb stack

33

AI-touching

Baseten · 435d ago 🔄 synced 38m ago

Software Engineer - Internal Platform

📍 San Francisco, US · Mid

Software engineer at Baseten building internal platform infrastructure for the AI inference company. Focus on developer tooling, CI/CD pipelines, monorepo management, and shared libraries to support engineering teams.

GoPythonKubernetesDockerHelmTerraform

33

AI-touching

Baseten · 48d ago 🔄 synced 38m ago

Integrated Marketing Manager

📍 San Francisco, US 🛠 AI tools welcome at work · Mid

Integrated Campaigns Manager at Baseten, an AI inference platform. Own multi-channel marketing campaigns (paid, email, content, events) driving pipeline and go-to-market momentum for developer-focused AI companies.

HubSpotClayApollo

30

AI-touching

Baseten · 98d ago 🔄 synced 38m ago

Performance Marketing Manager

📍 San Francisco, US · Mid

Performance Marketing Manager at Baseten, an AI inference platform. Own paid acquisition strategy across Google, LinkedIn, X, Reddit targeting ML engineers and AI builders. Build data-driven growth engine with funnel analytics and campaign optimization.

Google AdsLinkedInXReddit

27

AI-touching

Baseten · 177d ago 🔄 synced 38m ago

Manager, Startup Sales

📍 San Francisco, US · Manager

Sales Manager at Baseten leading a team of Account Executives selling AI infrastructure to startups. Responsible for team coaching, revenue growth, and cross-functional collaboration with product and engineering.

24

AI-touching

Baseten · 93d ago 🔄 synced 38m ago

Onboarding Program Manager

📍 San Francisco, US · Manager

Onboarding Program Manager at Baseten, an AI inference platform. Build and lead end-to-end onboarding for dozens of monthly hires, designing curriculum, facilitating learning, and enabling managers to accelerate ramp time in a technical environment.

23

AI-touching

Baseten · 38d ago 🔄 synced 38m ago

Infrastructure Finance Lead

📍 San Francisco, US · Senior

Infrastructure Finance Lead at Baseten, an AI inference platform. Own the P&L for compute infrastructure, manage cloud and GPU capacity forecasts, and partner with engineering teams on cost optimization and strategic capacity decisions.

AWSGCPGPU procurement

22

AI-touching

Baseten · 56d ago 🔄 synced 38m ago

Field Operations & Incentives Manager

📍 San Francisco, US · Manager

Field Operations & Incentives Manager at Baseten, an AI inference platform. Owns sales compensation design, territory management, quota administration, and RevOps governance for a high-growth SaaS company.

SalesforceExcelGoogle SheetsCaptivateIQEverstageSpiff

21

AI-touching

Baseten · 127d ago 🔄 synced 38m ago

Manager, Strategic Sales

📍 San Francisco, US · Manager

Sales Manager at Baseten leading a team of Strategic Account Executives selling AI infrastructure to frontier AI companies. Requires 6+ years closing sales experience with 2+ years in management, strong technical acumen in AI/cloud infrastructure, and based in San Francisco or New York with 3 days/week office requirement.

17

AI-touching

Baseten · 37d ago 🔄 synced 38m ago

Strategic Finance, GTM

📍 San Francisco, US · Senior

Strategic Finance, GTM lead at Baseten, an AI inference platform. Own revenue forecasting, sales capacity planning, quota setting, and GTM operating model for a consumption-based business serving AI companies.

16

AI-touching

Baseten · 17d ago 🔄 synced 38m ago

Recruiting Operations Lead

📍 San Francisco, US · Manager

Recruiting Operations Lead at Baseten, an AI inference platform. Build and scale recruiting processes, data infrastructure, and hiring programs for a fast-growing AI company.

Ashby

14

AI-touching

Baseten · 92d ago 🔄 synced 38m ago

People Business Partner, GTM

📍 San Francisco, US · Senior

People Business Partner supporting Baseten's Go-to-Market organization through rapid growth. Focus on org design, talent strategy, performance management, and leadership development for GTM teams scaling from 400+ employees.

12

AI-touching

Baseten · 93d ago 🔄 synced 38m ago

Senior Compensation Manager

📍 San Francisco, US · Senior

Senior Compensation Manager at Baseten, an AI inference platform. Owns company-wide compensation strategy, equity programs, leveling frameworks, and market benchmarking for a high-growth AI company.

ExcelGoogle SheetsHRIS

10

AI-touching

Baseten

Open roles