← All jobs · Stability AI

Multimodal Generative AI Researcher

Stability AI ·
78
AI-Agency
B95 U50
🌐 Remote-only Senior
PyTorchDeepSpeedRayCLIPLoRANeRFGaussian splatting
TL;DR

Research Scientist at Stability AI designing and fine-tuning large-scale Vision-Language Models for multimodal tasks including visual reasoning, 3D understanding, and embodied interaction. Bridges research breakthroughs with scalable training pipelines and production deployment.

Apply at Stability AI →
share:
you'll be redirected to the company's career page

Job description

Multimodal Generative AI Researcher

Location: Remote 

About the Role

We’re looking for a Research Scientist with deep expertise in training and fine-tuning large Vision-Language and Language Models (VLMs / LLMs) for downstream multimodal tasks. You’ll help push the next frontier of models that reason across vision, language, and 3D, bridging research breakthroughs with scalable engineering.

What You’ll Do

What You Bring

Bonus / Preferred

Equal Employment Opportunity:

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

Apply at Stability AI →

More open roles at Stability AI

Stability AI · 🔄 synced 2h ago
Research Scientist – Controlled 3D Generation
🌐 Remote-only · Senior
Research Scientist at Stability AI focused on 3D generation using flow-matching and diffusion models. Conduct research on controllable 3D content creation, design training pipelines, and develop conditioning techniques for meshes, Gaussians, and NeRFs.
PyTorchJAXCUDAPython
75
AI-core
Stability AI · 🔄 synced 2h ago
Generative AI Inference Engineer
📍 US 🌐 Remote-only · Senior
Generative AI Inference Engineer at Stability AI building multi-modal ML inference systems. Focus on optimization, deployment, and productionization of diffusion models at scale using PyTorch, Triton, and cloud infrastructure.
PyTorchTritonTensorRTKubernetesAWSGCP
71
AI-fluent