← All jobs · Tenstorrent

Sr Engineer, Server Inference

Tenstorrent ·
65
AI-Agency
B78 U45
📍 Belgrade, RS Senior
PythonDockerLinuxRISC-V
TL;DR

Senior backend engineer at Tenstorrent building inference server software for AI workloads on custom silicon. Focus on API design, deployment optimization, and scaling ML inference performance.

Apply at Tenstorrent →
share:
you'll be redirected to the company's career page

Job description

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

Join our Inference Server Technologies team, where we develop software that powers state-of-the-art AI inferencing on Tenstorrent’s cutting-edge hardware. Our team builds the layer that works on top of the Tenstorrent ML libraries - designing APIs, deploying workloads, and benchmarking end-to-end inference speed. You’ll help us shape how developers consume and scale model execution on Tenstorrent’s stack.

This role is hybrid based in Belgrade, Serbia.

We welcome candidates at various experience levels. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

 

Who You Are

 

What We Need

 

What You Will Learn

 

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.  Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).   These requirements apply to persons located in the U.S. and all countries outside the U.S.  As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency.  If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.

Apply at Tenstorrent →

More open roles at Tenstorrent

Tenstorrent · 🔄 synced 2h ago
RISC-V AI / HPC & Agentic Software Engineer
📍 New Taipei City, TW 🌐 Remote-only 🛠 AI tools welcome at work · Lead
RISC-V AI/HPC & Agentic Software Engineering Lead at Tenstorrent. Optimize LLK infrastructure and lead bring-up of RISC-V-native agentic AI software stacks, including runtime orchestration and distributed execution frameworks. Work at the hardware-software boundary with CPU architects and compiler engineers.
RISC-VCC++LLVMGCCHPC
89
AI-core
Tenstorrent · 🔄 synced 2h ago
RISC-V AI / HPC & Agentic Software Engineering Lead
📍 US 🌐 Remote-only 💰 $100K–$500K 🛠 AI tools welcome at work
RISC-V AI/HPC & Agentic Software Engineering Lead at Tenstorrent. Optimize low-level kernel infrastructure and build agentic AI software stacks on custom RISC-V processors, working at the hardware-software boundary.
RISC-VC++LLVMGCCHPCAI software stacks
85
AI-core
Tenstorrent · 🔄 synced 2h ago
Machine Learning Engineer, AI Models
📍 Nicosia, CY
Machine Learning Engineer at Tenstorrent optimizing LLMs and vision models on custom AI accelerators. Focus on porting, tuning, and validating models end-to-end, working across compiler, kernel, and hardware teams.
PyTorchTensorFlowCUDAC++
83
AI-core
Tenstorrent · 🔄 synced 2h ago
RISC-V CPU Microarchitecture / RTL
📍 US 🌐 Remote-only 💰 $100K–$500K 🛠 AI tools welcome at work
RISC-V CPU microarchitecture and RTL design engineer at Tenstorrent. Develop CPU unit specifications, RTL design, and verification for high-performance AI accelerator hardware. Uses AI tools to accelerate design process.
RISC-VVerilogSystemVerilogVHDL
82
AI-core
Tenstorrent · 🔄 synced 2h ago
Software Engineer, Kernel Development and Optimization
📍 Gdańsk, PL 🛠 AI tools welcome at work
Software engineer at Tenstorrent developing performance-critical GPU-style kernels for AI hardware. Focus on matrix multiplication, attention primitives, and data-movement optimization using C++ and low-level systems programming.
C++RISC-VGPU kernelsCUDA
82
AI-core