[Remote] Backend Engineer, Multi-cloud Inference Platform
Note: The job is a remote job and is open to candidates in USA. Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. They are seeking a Backend Engineer to build a multi-cloud, multi-tenant platform for inference services, focusing on operational excellence and scalable solutions.
Responsibilities
- Build the multi-cloud, multi-tenant platform powering Modular’s inference services
- Build fault-tolerant, low toil services able to make use of resources in a variety of hardware platforms Clouds (Tier 1 Cloud Providers & neoclouds)
- Push the envelope for operational excellence with request-to-kernel observability, multi-cloud deployments, cold-start optimizations, and more
- Build helm charts, kubernetes operators, and more to make a create simple, effective, maintainable deployments
Skills
- 5+ years of experience working in backend engineering
- Experience with service oriented architecture and distributed systems
- Experience with Cloud Providers (AWS, GCP, Azure, neoclouds)
- Experience with kubernetes and operating your own services
- A passion for building and operating high performance, low toil, observable systems
- Experience in machine learning technologies and use cases
- Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture
- Strongly identifies with our core company cultural values
- Practical experience implementing & maintaining security in multi-tenancy environments
- Experience working on high scale ML inference infrastructure (traditional AI or genAI)
- Familiarity with golang
Benefits
- Premier insurance plans
- Up to 5% 401k matching
- Flexible paid time off
- Stock options
- Annual target bonus
- Equity
- Team Building Events
- Regular team onsites and local meetups in Los Altos, CA as well as different cities
- Traveling 2-4 times a year is expected for all roles
Company Overview