← All companies
Company

Baseten

24 open roles scored by AI-agency.

24
Open roles
66
Avg AI-Agency
0
Remote-friendly

Open roles

Baseten · 🔄 synced 1h ago
Post-Training Applied Researcher
📍 San Francisco, US 🛠 AI tools welcome at work · Senior
Post-training applied researcher at Baseten building reward functions, training pipelines, and eval harnesses to fine-tune open-source LLMs for production use. Work directly with customers on domain-specific model training spanning healthcare, code generation, and agentic workflows.
PyTorchJAXGRPODPOPPORL
89
AI-core
Baseten · 🔄 synced 1h ago
Post-Training Research Scientist
📍 San Francisco, US · Senior
Post-training research scientist at Baseten, an AI inference platform. Role spans pure research on model learning and alignment, plus applied work on training methodology and inference optimization for production systems serving companies like Cursor and Notion.
PyTorchJAXTensorFlowvLLMTriton
85
AI-core
Baseten · 🔄 synced 1h ago
Software Engineer - AI Enablement
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Software engineer at Baseten building internal AI agents and LLM-powered workflows to boost engineering productivity. Focus on evaluating, deploying, and customizing AI coding assistants and custom agents for the engineering org.
PythonLLM agentsClaude CodeCursorCodex
80
AI-core
Baseten · 🔄 synced 1h ago
Forward Deployed Engineer
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Forward Deployed Engineer at Baseten, an AI inference platform. You'll architect and deploy production AI applications for customers like Cursor and Notion, combining hands-on coding with customer-facing implementation and product work.
PythonDockerML model deploymentAPI development
77
AI-core
Baseten · 🔄 synced 1h ago
Software Engineer - Training Product
📍 San Francisco, US 🛠 AI tools welcome at work · Senior
Software engineer at Baseten building training infrastructure for AI companies. Own features like multi-node training and serverless RL from conception to production, working across API, backend, and infrastructure layers.
PyTorchKubernetesvLLMNCCLMegatronDeepSpeed
76
AI-core
Baseten · 🔄 synced 1h ago
Post-Training Research Engineer
📍 San Francisco, US · Mid
Post-Training Research Engineer at Baseten building in-house tooling for custom model training. Focus on distributed GPU training, transformer parallelism, and performance optimization across the ML stack.
PyTorchTensorFlowJAXKubernetesRaySlurm
73
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - GPU Kernels
📍 San Francisco, US · Mid
GPU Kernel Engineer at Baseten building high-performance CUDA kernels for ML inference optimization. Focus on matrix multiplications, attention mechanisms, and quantization for production AI systems.
CUDAC++PTXNsight SystemsNsight ComputeTorch Profiler
73
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer — GPU Networking & Distributed Systems
📍 San Francisco, US · Senior
Software engineer at Baseten building GPU networking and distributed inference infrastructure. Focus on RDMA integration, optimizing communication for multi-node LLM serving, and characterizing performance on cutting-edge hardware clusters.
C++PythonNCCLNVSHMEMUCXTensorRT-LLM
72
AI-fluent
Baseten · 🔄 synced 1h ago
Engineering Manager - Model Performance
📍 San Francisco, US · Manager
Engineering Manager at Baseten leading a team focused on ML model inference and performance optimization. Responsibilities include managing engineers, optimizing LLM inference stacks, and driving production-scale ML deployment across frameworks like PyTorch, TensorRT, and CUDA.
PythonC++GoPyTorchTensorRTCUDA
72
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Model Performance
📍 San Francisco, US · Mid
Software engineer at Baseten optimizing ML model inference performance. Focus on quantization, speculative decoding, and LLM serving infrastructure using PyTorch, TensorRT, and CUDA.
PythonC++PyTorchTensorRTTensorRT-LLMvLLM
71
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Model APIs
📍 San Francisco, US · Mid
Software engineer at Baseten building Model APIs infrastructure for LLM serving. Focus on distributed systems, GPU optimization, inference performance, and developer-facing API platforms.
TensorRT-LLMvLLMSGLangCUDAKubernetesTensorRT
67
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer, Model Performance Tooling
📍 San Francisco, US · Entry
Software engineer at Baseten building performance benchmarking and diagnostic tools for LLM inference infrastructure. Focus on GPU profiling, cluster validation, and automated performance testing across hardware stacks.
PythonPyTorchNVIDIA Nsight SystemsC++InfiniBandGPU
65
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Model Developer Ecosystem
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Software Engineer - Model Developer Ecosystem at Baseten, an AI inference platform. Build model library guides, evaluation frameworks, and developer education to help engineers select the right models for their use cases.
PythonLLMinference platformsAPI
63
AI-fluent
Baseten · 🔄 synced 1h ago
Product Manager - Core Product
📍 San Francisco, US · Mid
Product Manager at Baseten shaping the core developer experience for AI inference platform. Owns roadmap for APIs, SDKs, and UI workflows that enable teams to build and deploy AI applications.
APIsSDKsML inferencemodel training
61
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Training Infrastructure
📍 San Francisco, US · Mid
Software Engineer on Baseten's Training Infrastructure team designing and architecting scalable ML training platforms. Focus on scheduling, storage, networking, and observability for distributed training workloads at scale.
GoPythonKubernetesAWSGCPPyTorch
61
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Enterprise Platform
📍 San Francisco, US · Senior
Senior Enterprise Platform Engineer at Baseten building infrastructure for AI inference. Focus on multi-cloud capacity management, self-hosted clusters, enterprise security, and Kubernetes-based solutions for mission-critical AI workloads.
Kubernetesdistributed systemscloud infrastructureMLOps
61
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Core Product
📍 San Francisco, US · Mid
Software engineer at Baseten building core product infrastructure for ML model deployment. Work spans CLI tools, REST APIs, web applications, and Kubernetes infrastructure serving AI companies like Cursor and Notion.
PythonGoJavaScriptReactKubernetesREST APIs
61
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - New Products
📍 San Francisco, US · Mid
Software engineer at Baseten building core platform capabilities for AI inference. Focus on API gateways, auth, quotas, metering, and multi-tenant isolation for production AI services.
KubernetesReactTypeScriptAPI gatewaysservice meshes
59
AI-fluent
Baseten · 🔄 synced 1h ago
Software Engineer - Infrastructure
📍 San Francisco, US · Mid
Infrastructure Software Engineer at Baseten building ML inference platform components. Focus on Kubernetes deployments, resource management, and monitoring systems for production AI applications.
PythonGoKubernetesPrometheusdistributed systems
57
AI-fluent
Baseten · 🔄 synced 1h ago
SRE
📍 San Francisco, US · Mid
SRE at Baseten managing technical success for enterprise customers running ML inference workloads. Diagnose and resolve production issues across Kubernetes, networking, and model serving infrastructure.
KubernetesGrafanaLokiPrometheusHelmFlux
56
AI-fluent
Baseten · 🔄 synced 1h ago
Infrastructure Ops Engineer
📍 San Francisco, US · Entry
Infrastructure Ops Engineer at Baseten managing GPU fleet operations and capacity fulfillment for AI inference. Coordinates hardware lifecycles, Kubernetes cluster maintenance, and customer capacity requests across global infrastructure.
KubernetesGPUH100A100B200cloud-native
55
AI-fluent
Baseten · 🔄 synced 1h ago
Data Engineer
📍 San Francisco, US · Mid
Data Engineer at Baseten building internal data platforms and analytics infrastructure for an AI inference company. Design data models, pipelines, and metrics to support product, engineering, and business teams.
Apache BeamKafkaAirflowOpenTelemetryGrafana
53
AI-fluent
Baseten · 🔄 synced 1h ago
Solution Architect
📍 San Francisco, US 🛠 AI tools welcome at work · Mid
Solution Architect at Baseten, an AI inference platform. Partner with sales and customers on technical discovery, demos, and POC execution for ML model deployments. Scope solutions, run benchmarks, and guide customers through inference infrastructure tradeoffs.
vLLMSGLangTRT-LLMPyTorchCUDA
51
AI-fluent
Baseten · 🔄 synced 1h ago
Site Reliability Engineer (SRE)
📍 San Francisco, US · Mid
Site Reliability Engineer at Baseten building scalable infrastructure for AI model inference and deployment. Focus on Kubernetes, CI/CD automation, multi-cloud capacity management, and operational excellence for mission-critical ML systems.
KubernetesTerraformGitHub ActionsPrometheusGrafanaELK stack
51
AI-fluent