[Remote] Manager, DevOps Engineering
Note: The job is a remote job and is open to candidates in USA. Deepwatch is the leader in managed security services, protecting organizations from ever-increasing cyber threats 24/7/365. The Manager of DevOps Engineering will lead the architecture, automation, and reliability of secure cloud infrastructure while mentoring a high-caliber SRE team.
Responsibilities
- Lead and grow the SRE team, setting direction, mentoring and managing engineers, and fostering excellence
- Design and manage cloud and containerized infrastructure with IaC (Terraform)
- Implement robust CI/CD pipelines integrating security and compliance
- Build scalable observability systems, leading the definition of SLIs / SLOs and dashboards
- Manage incident response, root cause analysis, and postmortems; automate recovery via playbooks/runbooks
- Drive capacity planning, performance tuning, and cost efficiency
- Collaborate with InfoSec, DevSecOps, and Compliance teams—ensuring alignment with frameworks like FedRAMP, NIST, RMF
- Support program-level initiatives, communicating effectively with stakeholders
- Promote a culture of reliability, security, and developer efficiency
- Maintain an active 'player' role, dedicating approximately 75% of your time to hands-on engineering (design, coding, and architecture) and 25% to leadership, mentorship, and management
Skills
- 8+ years in SRE, DevOps, or Platform Engineering; with technical leadership experience ready to step into management as a player/coach
- Proven cloud experience (AWS, GCP) and container orchestration (Kubernetes, Docker)
- Strong coding/scripting (Python, GO) and proficiency in IaC and GitOps
- Deep knowledge of observability tools and defining reliability metrics
- Experienced in incident handling (PagerDuty, Datadog) and post-incident evaluations
- Demonstrated success in mentoring and developing junior/mid-level SRE talent, moving beyond delegation to hands-on technical coaching
- Familiarity with regulatory or cybersecurity frameworks (FedRAMP, NIST, STIGs, RMF)
- Excellent cross-functional communication and stakeholder management
- Certifications such as AWS, CKA, or cyber security credentials (e.g., OSCP)
Benefits
- Medical, dental, vision, and disability insurance
- Flexible Time Off (FTO), 12 company holidays, sick leave and 8-Weeks Paid Parental Leave
- Unique professional development benefits with Annual “development dollars” to support our people growth and development
- Wellness contests and monthly educational programs
- 401(K) retirement program
Company Overview