[Remote] SRE with FedRAMP and DoD IL5.
Note: The job is a remote job and is open to candidates in USA. Dice is seeking a Senior Site Reliability Engineer to support the development and operation of their Kubernetes-based platform in regulated environments. The role focuses on ensuring the platform's reliability, observability, and compliance while collaborating with engineers across the stack.
Responsibilities
- Own and operate components of the Kubernetes platform, including deployment, upgrades, and maintenance
- Contribute to the design and implementation of scalable and reliable platform features
- Build and improve automation, tooling, and CI/CD workflows to reduce operational overhead
- Monitor system health and respond to issues; participate in on-call rotations and incident response
- Contribute to defining and tracking SLIs, SLOs, and error budgets
- Support FedRAMP High / IL5 compliance efforts, including system hardening, documentation, and audit readiness
- Collaborate with senior engineers, technical leaders, and cross-functional teams to deliver platform improvements
- Participate in on-call rotations supporting customer requests and paging alerts
- Participate in post-incident reviews and implement follow-up improvements
Skills
- 8+ years of experience in SRE, DevOps, or platform engineering
- Hands-on experience with Kubernetes and containerized workloads in production
- Hands-on experience with cloud platforms (AWS, Azure, or similar; GovCloud experience a plus)
- Strong working knowledge of Linux systems, networking, and distributed systems fundamentals
- Experience with Infrastructure as Code (e.g., Terraform)
- Ability to write and maintain scripts or services (e.g., Python, Go, Bash)
- Experience with monitoring and observability tools (Prometheus, Grafana, logging systems)
- Basic understanding of security and compliance concepts (e.g., NIST 800-53, STIGs, RMF)
- Exposure to FedRAMP High or DoD IL5 environments
- Experience with CI/CD systems and deployment automation (e.g., ArgoCD)
- Familiarity with container security and vulnerability management
- Experience working in regulated or audited environments
Company Overview