[Remote] Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. Ensono is a trusted technology adviser and managed services provider, helping clients navigate continuous change and embrace innovation. They are seeking a Site Reliability Engineer to support a variety of data solutions and enhance their SRE team, with opportunities for career progression as the division grows.
Responsibilities
- Act as a technical escalation point for unresolved data platform issues in the SRE Pod/s
- Monitor, maintain, and troubleshoot databases/data warehouses and related infrastructure
- Collaborate with the data engineering team to ensure efficient data flow and transformation
- Develop and maintain accurate technical documentation in the form of operational runbooks
- Perform standard pre-approved changes within the scope of our client’s Change Management Process (i.e. new users, etc.)
- Use Ensono’s helpdesk and work tracking systems to maintain logs of all support requests and incidents, and improve these processes, both technically and through stakeholder management
- Participate in the process for, and proactively mitigate risks in a Security management process (Vulnerabilities in Code, Infrastructure, Dependencies) aligned to both Ensono’s and our Clients compliance objectives
- Engaging with suppliers and 3rd parties for support, requests and opportunities, managing the relationship our clients get the best value for their service
- Troubleshooting issues and identifying systemic failings indicated by incidents/failures Implementing fixes and features
- Proposing solutions for reducing toil
- Implementing and refining automation for incident and service request resolution
- Providing leadership in the Incident resolution process, including creating and maintaining documentation, and leading Post-mortem analysis and mitigation planning
- Designing and Reinforcing Service Requests and Change Management (both technically and through stakeholder management) processes, and improving existing processes
- Develop and enhance the process for, and Proactively mitigate risks through Security management (Vulnerabilities in Code, Infrastructure, Dependencies)
- Lead discussion for multiple clients in client-facing meetings around the SRE process, identifying areas for increasing SRE footprint and identifying opportunities for small works and consultancy
- Engaging with: Suppliers and 3rd parties for support, requests and opportunities
- Cross-sale and cross-pollination opportunities within the Ensono organisation
Skills
- IAC tooling (Terraform preferably, or ARM/bicep and CloudFront)
- Core CI/CD Tooling (Azure DevOps, GitHub Actions or Gitlab)
- Monitoring Tooling (DataDog, Splunk, NewRelic, Azure Monitor, AWS CloudWatch)
- Demonstrable experience in multiple core technology (Dotnet, Java, AI/Data Engineering, Golang)
- Troubleshooting issues and identifying systemic failings indicated by incidents/failures
- Implementing fixes and features
- Proposing solutions for reducing toil
- Implementing and refining automation for incident and service request resolution
- Providing leadership in the Incident resolution process, including creating and maintaining documentation, and leading Post-mortem analysis and mitigation planning
- Designing and Reinforcing Service Requests and Change Management (both technically and through stakeholder management) processes, and improving existing processes
- Develop and enhance the process for, and Proactively mitigate risks through Security management (Vulnerabilities in Code, Infrastructure, Dependencies)
- Lead discussion for multiple clients in client-facing meetings around the SRE process, identifying areas for increasing SRE footprint and identifying opportunities for small works and consultancy
- Engaging with Suppliers and 3rd parties for support, requests and opportunities
- Cross-sale and cross-pollination opportunities within the Ensono organisation
- Cloud provider (AWS, Azure, GCP) ‘DevOps Engineer'-level certification and CKAD certification highly beneficial, or required during probationary period
Benefits
- Competitive base with uncapped commission
- The ability to work from a range of flexible locations
- Prestigious sales and broader team recognition with Annual Presidents Club
- Starting with 27 days annual leave (plus bank holidays) – accruing to 30
- 1/2 day leave on your birthday
- Sabbatical options at 5 & 10 years' service
- 5 days study leave
- Generous company pension
- Private healthcare for you and your family
- Payroll giving
- Enhanced paternity and maternity leave
- Equity appreciation program incentive plan
- Life and income protection
- Additional perks such as discounted gym memberships, cycle scheme, EAP and more!
Company Overview
Company H1B Sponsorship