← All jobs · Fireworks AI

Member of Technical Staff, Performance Optimization

Fireworks AI ·
71
AI-Agency
B88 U45
📍 San Mateo, US Senior 5+ yrs
CUDATritonPyTorchROCmKubernetesInfinibandRoCE
TL;DR

Member of Technical Staff, Performance Optimization at Fireworks AI. Optimize GPU kernels, distributed systems, and inference/training pipelines for large language models and vision models. Focus on latency, throughput, and cost-efficiency across multi-GPU environments.

Apply at Fireworks AI →
share:
you'll be redirected to the company's career page

Job description

The Role: 

We're looking for a Software Engineer focused on Performance Optimization to help push the boundaries of speed and efficiency across our AI infrastructure. In this role, you'll take ownership of optimizing performance at every layer of the stack—from low-level GPU kernels to large-scale distributed systems. A key focus will be maximizing the performance of our most demanding workloads, including large language models (LLMs), vision-language models (VLMs), and next-generation video models.

You’ll work closely with teams across research, infrastructure, and systems to identify performance bottlenecks, implement cutting-edge optimizations, and scale our AI systems to meet the demands of real-world production use cases. Your work will directly impact the speed, scalability, and cost-effectiveness of some of the most advanced generative AI models in the world.

Key Responsibilities:

Minimum Qualifications:

Preferred Qualifications:

Example projects:

Apply at Fireworks AI →

More open roles at Fireworks AI

Fireworks AI ⚡ AI-native · 🔄 synced 3h ago
Applied Machine Learning Engineer
📍 San Mateo, US · Mid
Applied Machine Learning Engineer at Fireworks AI building and deploying customer-facing AI applications. Responsibilities include fine-tuning models, developing PoCs, integrating new models, and optimizing performance for enterprise clients.
Pythonmachine learninggenerative AIfine-tuningRLHF
77
AI-core
Fireworks AI ⚡ AI-native · 🔄 synced 3h ago
Member of Technical Staff, Evals & Post-Training Product
📍 San Mateo, US 🛠 AI tools welcome at work · Mid
Member of Technical Staff at Fireworks AI building evaluation and post-training product experiences. Role spans backend systems, SDKs, and web interfaces to help developers improve models through evals and fine-tuning workflows.
PythonLLMPyTorchAPISDKbackend systems
76
AI-core
Fireworks AI ⚡ AI-native · 🔄 synced 3h ago
Member of Technical Staff, Research
📍 San Mateo, US · Mid
Member of Technical Staff on the Research team at Fireworks AI conducting foundational research to advance LLMs and multimodal systems. Focus on model efficiency, novel architectures, training methods, and transitioning research into production systems.
PythonC++PyTorchTensorFlowJAX
76
AI-core
Fireworks AI ⚡ AI-native · 🔄 synced 3h ago
Member of Technical Staff, Software Engineer
📍 San Mateo, US 🛠 AI tools welcome at work · Senior
Backend engineer at Fireworks AI building platform systems for AI model orchestration, fine-tuning, billing, and developer APIs. Focus on scalable services, product impact, and enterprise features.
73
AI-fluent
Fireworks AI ⚡ AI-native · 🔄 synced 3h ago
Solutions Architect
📍 San Mateo, US 🛠 AI tools welcome at work · Senior
Solutions Architect at Fireworks AI, a generative AI infrastructure company. Lead technical discovery, design AI solutions, execute POCs, and manage customer relationships from initial engagement through production deployment.
PythonAWSAzureGCPLLM inferencefine-tuning
73
AI-fluent