← All companies
Company

Baseten

53 open roles scored by AI-agency.

53
Open roles
53
Avg AI-Agency
0
Remote-friendly

Open roles

Baseten · 🔄 synced 11h ago
Software Engineer - Voice AI (Inference Runtime)
📍 San Francisco, US 🛠 AI tools welcome at work · Senior
Software engineer at Baseten building the Voice AI inference runtime stack. Focus on model serving, real-time latency optimization, and large-scale infrastructure for STT, TTS, and voice agent workloads.
PythonvLLMTensorRTONNXDockerKubernetes
83
AI-core
Baseten · 🔄 synced 11h ago
Post-Training Research Scientist
📍 San Francisco, US · Senior
Post-Training Research Scientist at Baseten, an AI inference platform. Pursue foundational and applied research in post-training methodology and inference optimization, with direct collaboration on production systems serving major AI companies.
PyTorchJAXTensorFlowTransformersvLLM
83
AI-core
Baseten · 🔄 synced 11h ago
Software Engineer - AI Enablement
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Software engineer at Baseten building internal AI agents and LLM-powered workflows to boost engineering productivity. Focus on evaluating, deploying, and customizing AI coding assistants and custom agents for the engineering org.
PythonLLM agentsClaude CodeCursorCodex
80
AI-core
Baseten · 🔄 synced 11h ago
Engineering Manager - Forward Deployed Engineering (LLM)
📍 San Francisco, US 🛠 AI tools welcome at work · Manager
Engineering Manager at Baseten leading a Forward Deployed Engineering team building and optimizing LLM inference systems for production customers. Hands-on player-coach role combining team leadership with direct technical contribution to customer deployments and core platform features.
PythonvLLMTensorRTTritonHugging FaceRay Serve
79
AI-core
Baseten · 🔄 synced 11h ago
Applied AI Inference Engineer
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Applied AI Inference Engineer at Baseten building production AI systems for customers. Role combines hands-on Python development, customer-facing implementation, and product engineering to deploy high-scale inference applications on Baseten's platform.
PythonDockerML model deploymentinference optimization
77
AI-core
Baseten · 🔄 synced 11h ago
AI Solutions Engineer
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
AI Solutions Engineer at Baseten, an inference platform for AI companies. Role combines hands-on Python development with customer-facing implementation, deploying production AI systems end-to-end from design through monitoring.
PythonDockerComfyUIWhisper
77
AI-core
Baseten · 🔄 synced 11h ago
Forward Deployed Engineer
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Forward Deployed Engineer at Baseten, an AI inference platform. You'll architect and deploy production AI applications for customers like Cursor and Notion, combining hands-on coding with customer-facing implementation and product work.
PythonDockerML model deploymentAPI development
77
AI-core
Baseten · 🔄 synced 11h ago
Software Engineer - Training Product
📍 San Francisco, US 🛠 AI tools welcome at work · Senior
Software engineer at Baseten building training infrastructure for AI companies. Own features like multi-node training and serverless RL from conception to production, working across API, backend, and infrastructure layers.
PyTorchKubernetesvLLMNCCLMegatronDeepSpeed
76
AI-core
Baseten · 🔄 synced 11h ago
Post-Training Research Engineer
📍 San Francisco, US · Mid
Post-Training Research Engineer at Baseten building in-house tooling for custom model training. Focus on distributed GPU training, transformer parallelism, and performance optimization across the ML stack.
PyTorchTensorFlowJAXKubernetesRaySlurm
73
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - GPU Kernels
📍 San Francisco, US · Mid
GPU Kernel Engineer at Baseten building high-performance CUDA kernels for ML inference optimization. Focus on matrix multiplications, attention mechanisms, and quantization for production AI systems.
CUDAC++PTXNsight SystemsNsight ComputeTorch Profiler
73
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer — GPU Networking & Distributed Systems
📍 San Francisco, US · Senior
Software engineer at Baseten building GPU networking and distributed inference infrastructure. Focus on RDMA integration, optimizing communication for multi-node LLM serving, and characterizing performance on cutting-edge hardware clusters.
C++PythonNCCLNVSHMEMUCXTensorRT-LLM
72
AI-fluent
Baseten · 🔄 synced 11h ago
Engineering Manager - Model Performance
📍 San Francisco, US · Manager
Engineering Manager at Baseten leading a team focused on ML model inference and performance optimization. Responsibilities include managing engineers, optimizing LLM inference stacks, and driving production-scale ML deployment across frameworks like PyTorch, TensorRT, and CUDA.
PythonC++GoPyTorchTensorRTCUDA
72
AI-fluent
Baseten · 🔄 synced 11h ago
Manager, Solutions Architect
📍 San Francisco, US 🛠 AI tools welcome at work · Manager
Manager of Solutions Architects at Baseten, an AI inference platform. Lead a team translating customer needs into technical solutions for LLM deployments, run technical discovery and POCs, and mentor architects on model serving optimization.
vLLMsglangTRT-LLMDocker
71
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Model Performance
📍 San Francisco, US · Mid
Software engineer at Baseten optimizing ML model inference performance. Focus on quantization, speculative decoding, and LLM serving infrastructure using PyTorch, TensorRT, and CUDA.
PythonC++PyTorchTensorRTTensorRT-LLMvLLM
71
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer- BIS (Baseten Inference Stack)
📍 San Francisco, US · Mid
Software Engineer on Baseten's Inference Stack team building distributed runtime for large-scale LLM inference. Work across the stack from developer experience to Kubernetes orchestration and traffic routing, enabling customers to deploy cutting-edge models with industry-leading performance and reliability.
KubernetesvLLMSGLangTensorRT-LLMDynamoPython
67
AI-fluent
Baseten · 🔄 synced 11h ago
GTM Engineer
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
GTM Engineer at Baseten building AI-powered workflows for sales, marketing, and support. Owns CRM enrichment strategy, audits and ships productivity tools using Clay, Salesforce, and AI coding assistants.
ClaySalesforceClaude CodeCursorn8nZapier
67
AI-fluent
Baseten · 🔄 synced 11h ago
Engineering Manager, Model Library
📍 San Francisco, US · Manager
Engineering Manager leading the Model Library team at Baseten, an AI inference platform. Responsible for building developer-facing APIs, model discovery, and evaluation frameworks for frontier AI models.
APIsSDKsLLM runtimesinference platforms
66
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer, Model Performance Systems
📍 San Francisco, US · Entry
Software Engineer at Baseten building performance benchmarking and diagnostic tools for LLM inference infrastructure. Focus on GPU profiling, cluster validation, and automated performance testing across hardware systems.
PythonPyTorchNVIDIA Nsight SystemsC++InfiniBandGPU
65
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Model Products
📍 San Francisco, US · Mid
Software engineer at Baseten building Model APIs for LLM inference serving. Focus on distributed systems, GPU optimization, and developer experience for hosted model endpoints.
TensorRT-LLMvLLMSGLangCUDAKubernetesTensorRT
65
AI-fluent
Baseten · 🔄 synced 11h ago
Product Manager - Dedicated Inference
📍 San Francisco, US · Mid
Product Manager at Baseten building the core developer experience for AI inference platform. Owns roadmap for APIs, SDKs, UI workflows, and integration surfaces that enable teams to deploy and manage AI applications.
APIsSDKsML inferencemodel training
63
AI-fluent
Baseten · 🔄 synced 11h ago
Data Center Network Engineer
📍 San Francisco, US · Senior
Data center network engineer at Baseten designing and operating high-performance GPU cluster infrastructure. Focus on InfiniBand/Ethernet fabric design, topology optimization, and performance validation for AI inference and training workloads.
InfiniBandRDMAEthernetGPU clusters
62
AI-fluent
Baseten · 🔄 synced 11h ago
Engineering Manager, Cloud Platform
📍 San Francisco, US · Manager
Engineering Manager at Baseten leading a cloud platform team building inference infrastructure for AI companies. Responsible for team hiring, technical direction, reliability standards, and incident response for production ML systems.
KubernetesTerraformCloudFormationPulumiGitHub ActionsGitLab CI
61
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Baseten for Labs
📍 San Francisco, US · Mid
Full-stack software engineer at Baseten building inference infrastructure and developer-facing products for AI model labs. Focus on API gateways, model discovery, auth, and multi-tenant systems serving frontier AI companies.
ReactTypeScriptKubernetesAPI gatewaysdistributed systems
61
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Training Infrastructure
📍 San Francisco, US · Mid
Software Engineer on Baseten's Training Infrastructure team designing and architecting scalable ML training platforms. Focus on scheduling, storage, networking, and observability for distributed training workloads at scale.
GoPythonKubernetesAWSGCPPyTorch
61
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Enterprise Platform
📍 San Francisco, US · Senior
Senior Enterprise Platform Engineer at Baseten building infrastructure for AI inference. Focus on multi-cloud capacity management, self-hosted clusters, enterprise security, and Kubernetes-based solutions for mission-critical AI workloads.
Kubernetesdistributed systemscloud infrastructureMLOps
61
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Dedicated Inference
📍 San Francisco, US · Mid
Software engineer at Baseten building the developer experience for deploying and operating AI inference workloads in production. Work spans CLI, SDKs, APIs, observability, and debugging tools for mission-critical deployments.
PythonGoJavaScriptReactKubernetesPostgreSQL
61
AI-fluent
Baseten · 🔄 synced 11h ago
SRE
📍 San Francisco, US 🛠 AI tools welcome at work · Senior
Site Reliability Engineer at Baseten building observability and automation for ML inference infrastructure. Focus on Kubernetes reliability, incident response, and operational tooling for multi-cloud deployments serving AI companies.
KubernetesTerraformHelmPrometheusVictoriaMetricsGrafana
59
AI-fluent
Baseten · 🔄 synced 11h ago
Software Engineer - Infrastructure
📍 San Francisco, US · Mid
Infrastructure Software Engineer at Baseten building ML inference platform components. Focus on Kubernetes deployments, resource management, and monitoring systems for production AI applications.
PythonGoKubernetesPrometheusdistributed systems
57
AI-fluent
Baseten · 🔄 synced 11h ago
Senior Manager, Cloud Platform & Site Reliability
📍 San Francisco, US · Director
Senior Manager of Cloud Platform & Site Reliability at Baseten, leading infrastructure and SRE teams. Oversees Kubernetes, multi-cloud infrastructure, and reliability standards for an AI inference platform serving companies like Cursor and Notion.
KubernetesTerraformGitHub ActionsPrometheusGrafanaOpenTelemetry
55
AI-fluent
Baseten · 🔄 synced 11h ago
OS / K8s Systems Engineer
📍 San Francisco, US · Senior
OS/Kubernetes systems engineer at Baseten building automation and infrastructure for GPU compute. Focus on cluster provisioning, OS image design, and orchestration systems that enable AI companies to deploy models at scale.
KubernetesPythonGoLinuxPXEGPU
55
AI-fluent
Baseten · 🔄 synced 11h ago
Infrastructure Ops Engineer
📍 San Francisco, US · Entry
Infrastructure Ops Engineer at Baseten managing GPU fleet operations and capacity fulfillment for AI inference. Coordinates hardware lifecycles, Kubernetes cluster maintenance, and customer capacity requests across global infrastructure.
KubernetesGPUH100A100B200cloud-native
55
AI-fluent
Baseten · 🔄 synced 11h ago
Cloud Platform Engineer
📍 San Francisco, US · Mid
Cloud Platform Engineer at Baseten building scalable infrastructure for ML model deployment and inference. Focus on Kubernetes, CI/CD automation, multi-cloud capacity management, and reliability systems for AI companies.
KubernetesTerraformGitHub ActionsPrometheusGrafanaAWS
55
AI-fluent
Baseten · 🔄 synced 11h ago
Engineering Manager, Internal Platform
📍 San Francisco, US 🛠 AI tools welcome at work · Manager
Engineering Manager at Baseten leading the internal platform team. Responsible for building developer tooling, CI/CD infrastructure, and shared libraries that amplify productivity across the engineering organization.
GoPythonKubernetesDockerHelmBazel
53
AI-fluent
Baseten · 🔄 synced 11h ago
Data Engineer
📍 San Francisco, US · Mid
Data Engineer at Baseten building internal data platforms and analytics infrastructure for an AI inference company. Design data models, pipelines, and metrics to support product, engineering, and business teams.
Apache BeamKafkaAirflowOpenTelemetryGrafana
53
AI-fluent
Baseten · 🔄 synced 11h ago
Solution Architect (AI/LLM Inference)
📍 San Francisco, US 🛠 AI tools welcome at work · Senior
Solution Architect at Baseten guiding customers on AI/LLM inference deployments. Leads technical discovery, demos, benchmarking, and POC execution for companies adopting inference infrastructure at scale.
vLLMSGLangTRT-LLMH100B200
52
AI-fluent
Baseten · 🔄 synced 11h ago
Solution Architect
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Solution Architect at Baseten, an AI inference platform. Partner with sales and customers on technical discovery, demos, and POC execution for ML model deployments. Scope solutions, run benchmarks, and guide customers through inference infrastructure tradeoffs.
vLLMSGLangTRT-LLMPyTorchCUDA
51
AI-fluent
Baseten · 🔄 synced 11h ago
Content Engineer
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Content Engineer at Baseten building technical content and SEO/AEO strategies for AI inference platform. Role combines technical writing, GTM automation, and agentic AI workflows to reach developer audiences.
PythonJavascriptSQLRetoolZapiern8n
45
AI-touching
Baseten · 🔄 synced 11h ago
Cost Analytics Lead
📍 San Francisco, US · Senior
Cost Analytics Lead at Baseten building data models and dashboards for cloud infrastructure cost, usage, and capacity tracking across an AI inference platform. Works with Finance, Infrastructure, and Product teams to optimize resource allocation and forecasting.
SQLdbtPythonAWSGoogle CloudSigma
43
AI-touching
Baseten · 🔄 synced 11h ago
Head of Legal Operations
📍 San Francisco, US 🛠 AI tools welcome at work · Director
Head of Legal Operations at Baseten, an AI inference platform. Own contract lifecycle management, legal intake workflows, AI-assisted tooling strategy, and reporting systems for a high-growth legal function scaling toward IPO.
Claude CoworkSlack
42
AI-touching
Baseten · 🔄 synced 11h ago
Security Engineer
📍 San Francisco, US · Mid
Security Engineer at Baseten building security infrastructure for an ML inference platform. Responsibilities include cloud security architecture, vulnerability management, incident response, IAM, compliance, and DevSecOps integration.
AWSGCPKubernetesSIEMIDSIPS
33
AI-touching
Baseten · 🔄 synced 11h ago
Software Engineer - Billing and Internal Tooling
📍 San Francisco, US · Mid
Software engineer at Baseten building billing and revenue infrastructure for an AI inference platform. Owns pricing, invoicing, metering, and internal tooling that supports Finance, Sales, and GTM teams.
OrbPostgreSQLweb stack
33
AI-touching
Baseten · 🔄 synced 11h ago
Software Engineer - Internal Platform
📍 San Francisco, US · Mid
Software engineer at Baseten building internal platform infrastructure for the AI inference company. Focus on developer tooling, CI/CD pipelines, monorepo management, and shared libraries to support engineering teams.
GoPythonKubernetesDockerHelmTerraform
33
AI-touching
Baseten · 🔄 synced 11h ago
Integrated Marketing Manager
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Integrated Campaigns Manager at Baseten, an AI inference platform. Own multi-channel marketing campaigns (paid, email, content, events) driving pipeline and go-to-market momentum for developer-focused AI companies.
HubSpotClayApollo
30
AI-touching
Baseten · 🔄 synced 11h ago
Performance Marketing Manager
📍 San Francisco, US · Mid
Performance Marketing Manager at Baseten, an AI inference platform. Own paid acquisition strategy across Google, LinkedIn, X, Reddit targeting ML engineers and AI builders. Build data-driven growth engine with funnel analytics and campaign optimization.
Google AdsLinkedInXReddit
27
AI-touching
Baseten · 🔄 synced 11h ago
Manager, Startup Sales
📍 San Francisco, US · Manager
Sales Manager at Baseten leading a team of Account Executives selling AI infrastructure to startups. Responsible for team coaching, revenue growth, and cross-functional collaboration with product and engineering.
24
AI-touching
Baseten · 🔄 synced 11h ago
Onboarding Program Manager
📍 San Francisco, US · Manager
Onboarding Program Manager at Baseten, an AI inference platform. Build and lead end-to-end onboarding for dozens of monthly hires, designing curriculum, facilitating learning, and enabling managers to accelerate ramp time in a technical environment.
23
AI-touching
Baseten · 🔄 synced 11h ago
Infrastructure Finance Lead
📍 San Francisco, US · Senior
Infrastructure Finance Lead at Baseten, an AI inference platform. Own the P&L for compute infrastructure, manage cloud and GPU capacity forecasts, and partner with engineering teams on cost optimization and strategic capacity decisions.
AWSGCPGPU procurement
22
AI-touching
Baseten · 🔄 synced 11h ago
Field Operations & Incentives Manager
📍 San Francisco, US · Manager
Field Operations & Incentives Manager at Baseten, an AI inference platform. Owns sales compensation design, territory management, quota administration, and RevOps governance for a high-growth SaaS company.
SalesforceExcelGoogle SheetsCaptivateIQEverstageSpiff
21
AI-touching
Baseten · 🔄 synced 11h ago
Manager, Strategic Sales
📍 San Francisco, US · Manager
Sales Manager at Baseten leading a team of Strategic Account Executives selling AI infrastructure to frontier AI companies. Requires 6+ years closing sales experience with 2+ years in management, strong technical acumen in AI/cloud infrastructure, and based in San Francisco or New York with 3 days/week office requirement.
17
AI-touching
Baseten · 🔄 synced 11h ago
Strategic Finance, GTM
📍 San Francisco, US · Senior
Strategic Finance, GTM lead at Baseten, an AI inference platform. Own revenue forecasting, sales capacity planning, quota setting, and GTM operating model for a consumption-based business serving AI companies.
16
AI-touching
Baseten · 🔄 synced 11h ago
Recruiting Operations Lead
📍 San Francisco, US · Manager
Recruiting Operations Lead at Baseten, an AI inference platform. Build and scale recruiting processes, data infrastructure, and hiring programs for a fast-growing AI company.
Ashby
14
AI-touching
Baseten · 🔄 synced 11h ago
People Business Partner, GTM
📍 San Francisco, US · Senior
People Business Partner supporting Baseten's Go-to-Market organization through rapid growth. Focus on org design, talent strategy, performance management, and leadership development for GTM teams scaling from 400+ employees.
12
AI-touching
Baseten · 🔄 synced 11h ago
Senior Compensation Manager
📍 San Francisco, US · Senior
Senior Compensation Manager at Baseten, an AI inference platform. Owns company-wide compensation strategy, equity programs, leveling frameworks, and market benchmarking for a high-growth AI company.
ExcelGoogle SheetsHRIS
10
AI-touching