[Remote] Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. General Dynamics Information Technology is a global technology and professional services company that delivers consulting, technology, and mission services to U.S. government agencies. They are seeking a Site Reliability Engineer to ensure the resilience and performance of mission-critical Defense systems by blending software engineering, automation, and operations expertise.
Responsibilities
- Build/Design and maintain highly available, scalable systems across cloud and on‑prem environments
- Develop automation solutions that improves observability, speeds recovery, and eliminates manual operational work
- Implement monitoring, alerting, and performance tuning strategies that ensure system health
- Collaborate with development and infrastructure teams to design reliable architectures and CI/CD pipelines
- Conduct root cause analysis and drive systemic improvements to prevent future incidents
- Champion SRE best practices such as SLIs/SLOs, error budgets, and automated incident response
- Provide inputs into proposal operations in area of subject matter expertise, collaborating on solution elements and providing written narratives that describe technical solution elements designed for a specific opportunity
Skills
- 15+ years of related experience
- Bachelor's with 15 years or an additional 4 years of work experience in lieu of degree
- Strong scripting and automation skills (Python, Bash, PowerShell, etc.)
- Hands-on experience with monitoring tools (Prometheus, Grafana, Splunk, ELK, Datadog, etc.)
- Familiarity with Kubernetes, container orchestration, and modern CI/CD pipelines
- Understanding of networking, Linux system internals, and distributed systems
- Ability to troubleshoot complex technical issues across the stack
- US Citizenship Required
- Candidate must possess active secret to start, and ability to attain Top Secret/SCI
- Experience supporting DoW or other federal programs
- Certifications such as Kubernetes (CKA/CKAD), AWS/Azure, or ITIL
- Experience implementing SRE frameworks at scale
Benefits
- A variety of medical plan options, some with Health Savings Accounts
- Dental plan options
- A vision plan
- A 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match
- Full flex work weeks where possible
- A variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
- 15 days of paid leave per calendar year to be used for vacations, personal business, and illness
- An additional 10 paid holidays per year
- Paid leave and paid holidays are prorated based on the employee’s date of hire
- The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees
- Short and long-term disability benefits
- Life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance
Company Overview