[Remote] Lead Machine Learning Engineer - ML Infrastructure
Note: The job is a remote job and is open to candidates in USA. Samsara is the pioneer of the Connected Operations™ Cloud, enabling organizations to harness IoT data for actionable insights. The Lead Machine Learning Engineer will own the architecture and evolution of the ML platform, collaborating across teams to improve safety outcomes in various industries.
Responsibilities
- Set the technical strategy and own end-to-end delivery of Samsara's ML platform (training, experimentation, batch/online inference, edge) — making architectural decisions and being the accountability point across all platform layers for multiple Safety AI product teams
- Drive the design, launch, and iteration of Safety AI features (CV models, EcoDriving insights, LLM-based reporting) — not just enabling others to ship, but co-owning outcomes including safety metrics, reliability, and cost at production scale
- Design and operate scalable online and batch inference systems (Ray, Spark), including deployment patterns, observability, SLOs, and unified training-to-production workflows
- Partner with firmware and edge teams to package, validate, and deploy models to Samsara devices, and build feedback loops from edge to cloud for continuous improvement
- Own reliability, observability, and security for ML systems across cloud and edge, including on-call practices, incident response, and infrastructure hardening
- Own or co-own end-to-end technical delivery for high-priority or high-risk initiatives, from modeling and system design through production rollout
- Be the technical authority for ML infrastructure architecture across Safety AI — setting direction that cross-functional teams (applied ML, firmware, security, data platform) execute against, mentoring senior engineers and applied scientists, and ensuring platform decisions are made at the right level of abstraction with the right trade-offs
- Drive strong developer experience through documentation and best practices, while contributing to and representing Samsara in open source communities (Ray, Spark, RayDP)
- Champion and role model Samsara’s cultural principles: Focus on Customer Success, Build for the Long Term, Adopt a Growth Mindset, Be Inclusive, Win as a Team
Skills
- 10+ years in machine learning engineering, with demonstrated tech lead ownership of at least two major ML platform domains (distributed training, data/research infrastructure, cloud inference, or feature engineering) serving multiple product teams at scale
- Proven record of shipping ML-powered features end-to-end — from design through production and iteration — with measurable impact on product or business metrics (not just building internal tooling)
- Hands-on Ray and Kubernetes expertise in production environments; Spark experience strongly preferred. Able to be a credible peer to the most senior engineers on the team
- Deep understanding of ML fundamentals beyond pipelines: evaluation methodology, dataset design, ablation, drift, and the ability to review and redirect modeling approaches — you bridge research and engineering, not just serve them
- Demonstrated cross-org technical leadership around platform decisions, and influencing roadmap and go/no-go calls based on throughput, latency, and cost trade-offs
- Experience navigating science-engineering tension — knowing when to hold the platform line and when to adapt for research velocity, and communicating that clearly to both sides
- Prior contributions to open source projects (Ray, Spark, RayDP, or Kubernetes)
- Experience with enterprise security/compliance in ML environments
- Background working with edge/on-device ML and firmware/embedded teams
Benefits
- Initial RSU grant with no vesting cliff, and ongoing refresh opportunities tied to performance, subject to plan terms and conditions
- Performance-based bonus/variable pay
- Equity (for eligible roles)
- Flexible, employee-led remote model
- Professional development stipend
- Comprehensive health and parental leave plans
- Flexible working model that caters to the diverse needs of our teams
- Offices are open for those who prefer to work in-person and we also support remote work where it aligns with our operational requirements
- Reasonable accommodations throughout the recruiting process for qualified persons with disabilities
Company Overview