[Remote] Site Reliability Engineer (SRE)
Note: The job is a remote job and is open to candidates in USA. Info Way Solutions is seeking a Site Reliability Engineer (SRE) to support the development and operation of Kubernetes-based platforms in regulated environments. The role involves designing and implementing Kubernetes platforms, improving reliability, and collaborating with various teams to ensure operational excellence.
Responsibilities
- Design, implement, and support Kubernetes platforms in FedRAMP High / IL5 environments
- Monitor, troubleshoot, and improve platform reliability, availability, and performance
- Develop automation and operational tooling to reduce manual effort
- Define and maintain SLIs, SLOs, and error budgets
- Support compliance, security audits, and continuous monitoring initiatives
- Build and maintain Infrastructure as Code (Terraform)
- Enhance CI/CD pipelines and deployment automation
- Collaborate with Security, Platform, and Application teams to resolve production issues
- Participate in on-call rotations and support production environments
Skills
- 4-6 years of experience in Site Reliability Engineering, DevOps, or Platform Engineering
- Strong hands-on experience with Kubernetes in production environments
- Experience with cloud platforms such as AWS, Azure, or AWS GovCloud
- Strong Linux administration and networking fundamentals
- Experience with Terraform or other Infrastructure as Code tools
- Programming/Scripting experience using Python or Go
- Experience with monitoring and observability tools such as Prometheus, Grafana, and centralized logging solutions
- Excellent troubleshooting and problem-solving skills
- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience)
- Strong communication and collaboration skills
- Ability to work independently in a remote, fast-paced engineering environment
- Must be a US Citizen
- Experience supporting FedRAMP High or DoD IL5 environments
- Experience with ArgoCD and CI/CD automation
- Knowledge of container security best practices
- Experience working within regulated or audited environments
Company Overview