[Remote] Senior DevOps Engineer
Note: The job is a remote job and is open to candidates in USA. Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform, making world-class healthcare available anytime, anywhere. As a Senior DevOps Engineer, you'll own and evolve the infrastructure that powers this platform, collaborating closely with engineering teams to ensure reliability and scalability.
Responsibilities
- Design, implement, and maintain scalable, resilient infrastructure to support Sword Health’s high-demand applications and services
- Automate and streamline deployment processes, CI/CD pipelines, and routine maintenance tasks to enhance efficiency and reduce downtime
- Monitor and optimize system performance, proactively identifying and resolving issues to ensure high availability and reliability
- Collaborate closely with development, data, and security teams to ensure seamless integration of infrastructure and code changes
- Drive security best practices by implementing and managing access control, network security, and compliance-related policies across the infrastructure
- Lead incident response and troubleshooting for infrastructure-related issues, ensuring rapid and effective resolution to maintain service continuity
- Mentor and guide junior team members, sharing DevOps best practices and fostering a culture of continuous learning and improvement within the team
- Stay up-to-date with industry trends and emerging technologies, bringing innovative solutions to Sword Health’s DevOps processes and toolchains
Skills
- Experience with Linux environments
- Experience with DevOps and GitOps methodologies
- Experience with Kubernetes and Containerized applications (Docker)
- Experience with Infrastructure as Code (Terraform)
- Experience with Monitoring Tools (Google Cloud Monitoring/StackDriver, Grafana, Prometheus/AlertManager, NewRelic)
- Experience with Jenkins
- Experience with CI/CD
- Team player, Solution-oriented, Proactive attitude with “Get Things Done” mindset
- Enthusiast and interested in technologies and innovation
- Fluent in English (written and oral)
- Usage of AI to debug and develop Infrastructure tooling
- Development of AI Agents to automate processes and Infrastructure monitoring and provisioning
- Experience/Knowledge with Kafka
- Experience/Knowledge with Prometheus/AlertManager
- Experience/Knowledge with Grafana
- Experience/Knowledge with Elasticsearch/ Logstash/ Kibana
- Experience/Knowledge with Vault
- Experience/Knowledge with Redis
- Experience/Knowledge with MySQL
- Experience/Knowledge with DNS
- Experience with PHP
- Experience with Javascript
- Experience with GoLang
- Experience provisioning servers and services using AWS
- Experience provisioning servers and services using Azure
- Experience provisioning servers and services using GCP
- Experience/Knowledge with Istio
- Good know-how about Cloud Networking including VPC Management
- Good know-how about Routing
- Good know-how about NAT
- Good know-how about overall troubleshooting using TCPdump analysis
Benefits
- A stimulating, fast-paced environment with lots of room for creativity
- A bright future at a promising high-tech startup company
- Career development and growth, with a competitive salary
- The opportunity to work with a talented team and to add real value to an innovative solution with the potential to change the future of healthcare
- A flexible environment where you can control your hours (remotely) with unlimited vacation
- Access to our health and well-being program (digital therapist sessions)
- Remote or Hybrid work policy
- Comprehensive health, dental and vision insurance*
- Life and AD&D Insurance*
- Financial advisory services*
- Supplemental Insurance Benefits (Accident, Hospital and Critical Illness)*
- Health Savings Account*
- Equity shares*
- Discretionary PTO plan*
- Parental leave*
- 401(k)
- Flexible working hours
- Remote-first company
- Paid company holidays
- Free digital therapist for you and your family
Company Overview