← All jobs · Anyscale

Software Engineer, Observability (Full-Stack)

Anyscale ·
55
AI-Agency
B62 U45
📍 Bengaluru, IN Mid
PythonRayDatadogSplunkAWS CloudWatch
TL;DR

Backend software engineer at Anyscale building observability tools for the Ray distributed computing platform. Focus on dashboards, log aggregation, and monitoring features for AI applications.

Apply at Anyscale →
share:
you'll be redirected to the company's career page

Job description

About Anyscale


At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We’re commercializing Ray, a popular open-source project that's creating an ecosystem of libraries for scalable machine learning. Companies like OpenAIUberSpotifyInstacartCruise, and many more, have Ray in their tech stacks to accelerate the progress of AI applications out into the real world.


With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can scale an ML application from their laptop to the cluster without needing to be a distributed systems expert.


Proud to be backed by Andreessen Horowitz, NEA, and Addition with $250+ million raised to date.


About the role

We are seeking a Backend Software Engineer to join our team focused on building user-facing application features for the Anyscale AI platform. The role involves interacting with users, understanding their requirements, designing and implementing features, and finally maintaining and improving these features over time. The backend of the platform generally deals with implementing the core business logic of these features.


About the team

The Workspace & Observability Team is dedicated to empowering clients to create robust AI applications using our powerful platform built on Ray. We are a collaborative group of experts committed to providing bespoke monitoring tools and integrations that enhance the development lifecycle. In particular, these tools accelerate the process of writing, debugging, deployment, and monitoring of AI applications. 


Observability in a distributed cluster can deal with a ton of data. There are a ton of interesting problems to solve around how to ingest, aggregate, format, and ultimately present that data to our users in a digestible way. With Ray and Anyscale, we have the opportunity to provide great tools out of the box for our users. Join us in shaping the future of AI application development!

A snapshot of projects you may work on

We'd love to hear from you if have

Anyscale Inc. is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. 


Anyscale Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish

Apply at Anyscale →

More open roles at Anyscale

Anyscale ⚡ AI-native · 🔄 synced 3h ago
Software Engineer, Model Serving Infrastructure
📍 Bengaluru, IN 🛠 AI tools welcome at work · Mid
Software engineer at Anyscale building Ray Serve, a production-grade ML serving framework. Focus on distributed systems, model routing, state management, and observability for high-scale inference infrastructure.
PythonC++RaygRPCKubernetesTensorFlow
76
AI-core
Anyscale ⚡ AI-native · 🔄 synced 3h ago
Engineering Manager, ML Developer Experience
📍 Bengaluru, IN · Manager
Engineering Manager, ML Developer Experience at Anyscale building tools and infrastructure for Ray-based ML applications. Leads full-stack development of workspaces, ML Ops tooling, SDKs, and production serving systems.
RayPyTorchMLFlowPython
66
AI-fluent
Anyscale ⚡ AI-native · 🔄 synced 3h ago
Software Engineer (Ray Data)
📍 San Francisco, US · Mid
Software engineer at Anyscale building Ray Data, a Python-native data processing engine for AI workloads. Focus on performance optimization, distributed systems scaling, and fault tolerance for production ML pipelines.
PythonRaydistributed systemsdata processing
65
AI-fluent
Anyscale ⚡ AI-native · 🔄 synced 3h ago
Software Engineer (Ray Core)
📍 San Francisco, US · Senior
Software engineer at Anyscale building Ray Core, a distributed computing backend. Focus on performance optimization, fault tolerance, and reliability of the C++ scheduler and runtime systems.
C++Raydistributed systemsGPU programming
61
AI-fluent
Anyscale ⚡ AI-native · 🔄 synced 3h ago
Software Engineer (Ray Core)
📍 Bengaluru, IN · Mid
Software engineer at Anyscale building Ray Core, an open-source distributed computing framework. Focus on C++ backend development, performance optimization, fault tolerance, and testing infrastructure for scalable ML applications.
C++RayPythondistributed systemsGPU programming
61
AI-fluent