← all jobs

[Remote] Senior Site Reliability Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. ARA is a company focused on Information Technology, and they are seeking a Senior Site Reliability Engineer. The role involves partnering with development and IT teams to enhance system operability and support, while also maintaining operational standards and improving platform stability.

Responsibilities

  • Partner with software developers, platform engineers, and IT staff to improve system design, operability, deployment safety, and production support readiness
  • Define and maintain operational standards, runbooks, support procedures, escalation paths, and service-level objectives
  • Evaluate system architecture and changes to ensure they balance functional requirements, service quality, reliability, security, and compliance needs
  • Drive continuous improvement in platform stability, maintenance, and availability
  • Provide advanced technical support and troubleshooting for complex platform and service issues affecting internal users and stakeholders

Skills

  • 8+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, Systems Engineering, or related infrastructure roles supporting production services
  • Strong experience with Linux systems administration and troubleshooting in enterprise environments
  • Strong experience operating and maintaining on-prem Kubernetes platforms and all related components including CRI, CNI, and CSI plugins
  • Experience deploying and maintaining applications on Kubernetes using Helm, Kustomize, and similar tooling
  • Experience supporting DevOps tooling such as GitLab, Artifactory, Jira, Confluence
  • Experience with GitOps tools such as FluxCD or ArgoCD
  • Proficiency scripting with at least one of Python, Go, or Bash
  • Strong experience designing, maintaining, and maturing observability tooling including monitoring, dashboards, logging and tracing, and supporting SLOs
  • Strong understanding of reliability engineering concepts: Service health indicators, High availability design, failure reduction, and testing, Operational readiness practices, including developing documentation, runbooks, and architectural descriptions, Incident response, root cause analysis, remediation/recovery
  • Ability to obtain a security clearance, which includes U.S. citizenship
  • Bachelor's degree in CS, Software Engineering or other IT-related field or equivalent experience
  • Experience with multiple Linux distributions including Ubuntu
  • Experience with at least one of the following: Tanzu Kubernetes, Nutanix Kubernetes Platform, Canonical Kubernetes
  • Experience with cloud platforms such as AWS and Azure
  • Experience with infrastructure automation and configuration management
  • Experience managing AI tooling on Kubernetes including MCP Servers, LLM platforms (vLLM, Ollama), Kubeflow
  • Experience with security and compliance considerations in regulated environments
  • DoD experience
  • Active or inactive Secret Security Clearance

Company Overview

  • ARA provides research, engineering, and technical support services. It was founded in 1979, and is headquartered in Albuquerque, New Mexico, USA, with a workforce of 1001-5000 employees. Its website is https://www.ara.com.
  • More open positions

    [Remote] Account Sales Manager - Pennsylvania

    Work from home Full-time role

    [Remote] Senior Developer Advocate Engineer - Robotics and Physical AI

    Work from home Full-time role

    [Remote] Lead Web Platform Engineer

    Work from home Full-time role

    [Remote] Cloud Operations Engineer Job Details | Capgemini

    Work from home Full-time role

    [Remote] Senior Business Analyst

    Work from home Full-time role

    Remote Overnight Chat Coordinator – Night‑Shift Customer Support Specialist – $25‑$35/hr – No Experience Required

    Work from home Full-time role

    Service Now and SAP QA

    Work from home Full-time role

    Senior Financial Accounting Analyst – Corporate Controllership & Intercompany Operations

    Work from home Full-time role

    [Remote] Senior Clinical Program Manager

    Work from home Full-time role

    Dynamic Help Desk Support & Customer Service Specialist – Technical Assistance, Issue Resolution, and Client Success at careerzynith

    Work from home Full-time role

    [Remote] Program Management Director (Water/Wastewater Infrastructure)

    Work from home Full-time role

    EdTech Co-Founder / CEO (100 % remote) (m/f/d)

    Work from home Full-time role

    Experienced Virtual Customer Care Representative – Remote Work Opportunity with careerzynith

    Work from home Full-time role

    Surface Transportation Manager

    Work from home Full-time role

    Adjunct Global Campus Instructor - College Writing (Remote)

    Work from home Full-time role

    Business Intelligence Analyst Senior

    Work from home Full-time role

    Experienced Part-time Data Entry Executive – Remote Database Management

    Work from home Full-time role

    Client Success & Operations Manager

    Work from home Full-time role

    Bilingual Remote Customer Service Advocate – Financial Services & Retirement Solutions, Full Training, Career Growth Opportunities at careerzynith

    Work from home Full-time role

    Part‑Time Help‑to‑Claim Telephone & Webchat Adviser – Remote (UK) – 24 hrs per week, Fixed‑Term to March 2025

    Work from home Full-time role

    Fully Remote - License Master Social Worker (LMSW) in Phoenix, AZ

    Work from home Full-time role