[Remote] Staff Cloud Operations Engineer
Note: The job is a remote job and is open to candidates in USA. Branch is on a mission to empower workers with financial freedom by providing accessible financial services. They are seeking a Staff Cloud Operations Engineer to join their team, where the role involves coordinating with various departments, maintaining cloud infrastructure, and ensuring compliance with security standards.
Responsibilities
- Effectively coordinate with both technical and non-technical staff across departments; this role involves a good amount of cross-functional collaboration, including partnering on workflow automation (crons, n8n, Airflow) that bridges infrastructure and business processes
- Comfortable with the process side of an Ops team: running outage incidents, working within compliance requirements, coordinating sprints, collecting metrics to report up to management, and maintaining documentation
- Maintain and improve incident response process and tooling, ensuring accurate, reliable, and proactive monitoring across the application stack
- Ensure that our cloud infrastructure is compliant with relevant security and regulatory standards
- Design, implement, and maintain our cloud infrastructure in GCP
- Ensure our cloud infrastructure is scalable, secure, and highly available
- Build automation that reduces toil and enables infrastructure to self-heal during production incidents
- Troubleshoot issues and provide support for our cloud infrastructure and observability tools
Skills
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience
- Professional, expert level experience in cloud infrastructure engineering (preferably GCP) with minimum 5 years experience
- Experience with observability tools and processes, such as monitoring, logging, and tracing
- Strong experience in at least one programming/scripting language such as Python, Bash, Java, or Go
- Experience with containerization technologies such as Docker and Kubernetes
- Experience collaborating with network and application security teams; ability to make sound security-informed judgments is a plus
- Comfortable leveraging AI tools to accelerate work. The company provides access to Claude, and integrates Copilot and Gemini into automation workflows; we expect engineers to use these effectively and look for opportunities to apply them
- Ability to work in a 24/7 on call rotation
- Familiarity with CI/CD pipelines and Git
- Collaboration experience with Data Engineering teams
- Experience with automation and configuration management tools such as Terraform, Ansible, or Chef
- Previous experience in a 24/7 on call environment
- Mentorship or technical leadership experience; comfortable directing the work of junior engineers on a project basis, though this is not a day-to-day people management role
Benefits
- Market-leading medical, dental, and vision insurance
- Stock options
- Free Premium-Tier Origin Financial Wellness subscription
- Monthly home-office stipend
- 401k (TransAmerica)
- 12-weeks paid parental leave for birthing and non-birthing parents
- Flexible time off + sick and safe time
- 11 paid company holidays
- Branch@Branch Same Day Pay Option
Company Overview