[Remote] Dev Ops Engineer
Note: The job is a remote job and is open to candidates in USA. Sage Care is a fast-growing, early-stage healthcare startup transforming healthcare through AI-driven care navigation. The DevOps Engineer will architect and maintain the infrastructure for the platform, focusing on cloud infrastructure, Kubernetes reliability, and CI/CD pipelines.
Responsibilities
- Design, build, and maintain cloud infrastructure in GCP using Terraform
- Architect and manage Kubernetes (GKE) clusters across dev, staging, and production
- Improve networking, IAM, ingress architecture, and environment isolation
- Build reusable infrastructure modules and eliminate configuration drift
- Ensure infrastructure is scalable, cost-efficient, and production-grade
- Design and maintain CI/CD pipelines that enable safe, rapid deployments
- Own and optimize Bazel build configuration, caching, and reproducibility
- Improve build performance and developer velocity within a monorepo environment
- Implement safe release strategies (canary, blue/green, rollbacks)
- Ensure environment parity and reduce deployment-related incidents
- Implement and improve logging, monitoring, and alerting across services and Kubernetes workloads
- Establish SLIs/SLOs and drive reliability improvements
- Reduce MTTR through improved visibility and incident response processes
- Improve production readiness standards and postmortem practices
- Ensure infrastructure supports increasing AI workloads and real-time traffic
- Strengthen IAM, secrets management, and least-privilege access controls
- Harden Kubernetes clusters and cloud infrastructure against misconfiguration
- Partner with security leadership to support HIPAA and SOC2 compliance requirements
- Improve auditability, change tracking, and infrastructure governance
- Embed security best practices directly into CI/CD and build workflows
Skills
- 4–7+ years of DevOps, SRE, or Infrastructure Engineering experience
- Strong hands-on experience with GCP and Kubernetes (GKE preferred)
- Deep experience with Terraform and infrastructure-as-code
- Experience building and maintaining CI/CD pipelines
- Experience working with Bazel or similar build systems in production environments
- Strong understanding of networking, IAM, and cloud security fundamentals
- Experience supporting production systems with uptime requirements
- Experience supporting HIPAA or SOC2 environments
- Experience in early-stage or high-growth startups
- Familiarity with AI/ML infrastructure or real-time systems
- Experience with release engineering best practices in monorepos
Company Overview