Baseten
·
🔄 synced 1h ago
Engineering Manager - Model Performance
📍 San Francisco, US
· Manager
Engineering Manager at Baseten leading a team focused on ML model inference and performance optimization. Responsibilities include managing engineers, optimizing LLM inference stacks, and driving production-scale ML deployment across frameworks like PyTorch, TensorRT, and CUDA.
PythonC++GoPyTorchTensorRTCUDA