← All jobs · Cerebras Systems

Data Center Commissioning Lead

Cerebras Systems ·
31
AI-Agency
B35 U25
📍 US 🌐 Remote-only 💰 $220K–$260K Lead 10–15+ yrs
TL;DR

Data Center Commissioning Lead at Cerebras Systems overseeing end-to-end commissioning and readiness of AI data center infrastructure across colocation environments. Responsible for testing, validation, and operational handover of mission-critical systems supporting Cerebras' wafer-scale AI chips.

Apply at Cerebras Systems →
share:
you'll be redirected to the company's career page

Job description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.  

Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. 

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

The Role

Cerebras is seeking a Commissioning Lead to own the end-to-end commissioning and readiness of AI data center infrastructure across colocation environments. This role is responsible for ensuring all systems are tested, validated, and fully operational prior to handover, with zero tolerance for failures in mission-critical environments. You will operate with high ownership in a fast-paced startup environment, driving commissioning execution across multiple concurrent sites and ensuring rapid, reliable capacity bring-up.

Responsibilities

• Lead commissioning strategy and execution across all colo data center deployments.

• Own full lifecycle commissioning from Level 1–5 testing through integrated systems testing (IST).

• Develop and enforce commissioning plans, scripts, and procedures.

• Coordinate with construction, engineering, vendors, and colo providers to ensure readiness.

• Oversee testing of electrical systems (switchgear, UPS, generators), mechanical systems (cooling), and IT infrastructure.

• Ensure all systems meet design intent, performance requirements, and reliability standards.

• Drive issue identification, resolution, and closure prior to handover.

• Manage commissioning agents, vendors, and third-party testing teams.

• Establish standardized commissioning processes for repeatable deployments.

• Track and report commissioning progress, risks, and readiness to executive leadership.

• Ensure all documentation, test results, and turnover packages are complete and accurate.

• Validate base building readiness from colo providers prior to fit-out energization.

• Coordinate integration between landlord systems and tenant infrastructure.

• Ensure alignment on power availability, redundancy, and cooling capacity.

• Resolve interface issues between colo infrastructure and Cerebras systems.

• Hold providers accountable for performance during testing and energization.

Skills & Qualifications

• 10–15+ years of experience in commissioning of mission-critical facilities.

• Deep expertise in data center electrical and mechanical systems.

• Experience leading Level 1–5 commissioning for large-scale projects.

• Strong understanding of high-density compute environments.

• Experience working in colo environments and coordinating landlord/tenant interfaces.

• Proven ability to manage multiple sites and fast-track deployments.

• Strong troubleshooting and problem-solving skills.

• Ability to operate in a fast-paced, high-growth startup environment.

• Excellent communication and stakeholder management skills.

 

Location: Remote, USA

The base salary range for this position is $220,000 to $260,000 annually.  Actual compensation may include bonus and equity, and will be determined based on factors such as experience, skills, and qualifications.

 

Why Join Cerebras

People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection  point in our business. Members of our team tell us there are five main reasons they joined Cerebras:

  1. Build a breakthrough AI platform beyond the constraints of the GPU.
  2. Publish and open source their cutting-edge AI research.
  3. Work on one of the fastest AI supercomputers in the world.
  4. Enjoy job stability with startup vitality.
  5. Our simple, non-corporate work culture that respects individual beliefs.

Read our blog: Five Reasons to Join Cerebras in 2026.

Apply today and become part of the forefront of groundbreaking advancements in AI!


Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.


This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.

Apply at Cerebras Systems →

More open roles at Cerebras Systems

Cerebras Systems · 🔄 synced 5h ago
Principal ML Investigator
📍 Sunnyvale, US 🛠 AI tools welcome at work · Principal
Principal ML Investigator at Cerebras Systems building ML research and advanced development on the company's wafer-scale AI chip platform. Focus on post-training, dataset optimization, LLM pretraining, sparsity, and domain-specific agents.
PyTorchJAXTensorFlowTritondistributed training frameworks
89
AI-core
Cerebras Systems · 🔄 synced 5h ago
Advanced Technology: AI/ML Research Scientist
📍 Sunnyvale, US · Senior
AI/ML Research Scientist at Cerebras Systems designing and developing AI models and training methodologies on wafer-scale hardware. Work spans optimization theory, model architecture, and computational science to explore new algorithmic possibilities enabled by novel hardware architecture.
PythonPyTorchC
87
AI-core
Cerebras Systems · 🔄 synced 5h ago
Applied AI/ML Scientist
📍 AE · Senior
Applied AI/ML Scientist at Cerebras Systems developing and customizing large language models on the company's wafer-scale AI chip. Responsibilities include training custom models, fine-tuning LLMs, designing agentic systems, and serving as technical expert for customer AI initiatives.
PythonPyTorchTransformersLLMsRLHFDPO
87
AI-core
Cerebras Systems · 🔄 synced 5h ago
AI Engineer, Model Quality and Performance
📍 Sunnyvale, US 🛠 AI tools welcome at work · Mid
AI Engineer at Cerebras Systems building model quality and performance systems for wafer-scale AI inference. Design eval suites with AI agents, automate release qualification, and build benchmarking workflows for customer use cases.
ClaudeDockerGitPythonPyTorchJAX
83
AI-core
Cerebras Systems · 🔄 synced 5h ago
LLM Inference Performance & Evals Engineer
📍 Toronto, CA 🛠 AI tools welcome at work · Mid
LLM Inference Performance & Evals Engineer at Cerebras Systems building performance-eval pipelines and benchmarking cutting-edge model optimizations on wafer-scale hardware. Prototype architectural tweaks, develop agent-driven automation for experiments, and collaborate across compiler, runtime, and silicon teams.
PythonTritonLLVMMLIRFlash-AttentionTransformers
83
AI-core