Cloud Dev AI Ops Engineer (6012)
About the position We are looking for a skilled Cloud DevAIOps Engineer to design, build, and operate our cloud infrastructure and delivery pipelines. You will focus on designing high-quality, reliable, and scalable solutions across our multiple data center environments, drive automation and integrations, and support GenAI strategic transformations.
Responsibilities
- Design, provision, and manage scalable cloud infrastructure on AWS / Azure / GCP.
- Build and maintain CI/CD pipelines to automate build, test, and deployment workflows.
- Deploy and manage containerized workloads using Docker and Kubernetes (EKS / AKS / GKE / Cloud Run).
- Implement and maintain monitoring, alerting, and observability solutions (Prometheus, Grafana, Datadog).
- Enforce cloud security best practices including IAM, secrets management, and compliance controls.
- Optimize cloud costs through resource right-sizing, autoscaling, and tagging strategies.
- Collaborate with developers to improve developer experience and platform reliability.
- Maintain infrastructure as code, runbooks, and architectural documentation.
- Support incident response, root cause analysis, and on-call rotations.
Requirements
- 3–5+ years in a Cloud, DevOps, or Infrastructure engineering role building scalable distributed systems and event-driven solutions.
- Hands-on experience supporting production workloads on GCP, AWS, or Azure.
- Linux administration and multi-cloud networking fundamentals (DNS, VPCs, load balancing).
- Strong Docker and Kubernetes experience in production environments.
- Proficiency with Terragrunt / Terraform, or equivalent IaC tooling.
- CI/CD pipeline experience (GitHub Actions, GitLab CI, or similar).
- IAM, OAuth & AuthZ deployments (Ory Hydra, Cerbos, or equivalent).
- Scripting skills in Python and/or Bash.
- Bachelor’s degree in Computer Science, Information Systems, or a related field — or equivalent technical experience.
Nice-to-haves
- Cloud certification (GCP Cloud Engineer, AWS Solutions Architect, AZ-305, CKA, or equivalent).
- Expertise building Kubernetes infrastructure and event-driven solutions (Kafka, AWS SQS, GCP Pub/Sub or Tasks).
- Exposure to GenAI trends: Claude, Gemini, Windsurf, Google ADK, and MCPs is an advantage.
- Exposure to SAP ERP or SAP TM on HANA infrastructure is a plus to support transitions.
- GitOps experience (GitHub Actions, ArgoCD, Flux).
- DevSecOps tooling (Aikido, Checkov, Snyk, Trivy).
- FinOps / cloud cost management experience.
- Experience with databases: Postgres, ClickHouse.
- On-premises hardware management experience (Proxmox).
Benefits
- Medical, Dental, and Vision insurance (employee and family coverage)
- Company-paid basic life insurance
- Short-Term & Long-Term Disability insurance
- Health Savings Account with company contributions
- Flexible Spending Account options
- 401(k) retirement savings plan with 3.5% employer match
- 80 hours of front-loaded Sick Pay
- 80 hours of Vacation Pay annually, with increases based on tenure
- 7 paid holidays per year
- Employee Assistance Program (EAP)