← all jobs

Site Reliability Engineer-3

Work from home Full-time role Hiring

Position Summary We are looking for a highly motivated and high-potential Senior Site Reliability Engineer (SRE) to join our team, lead impactful initiatives, and further elevate your career in reliability engineering. This is a transformative moment to be part of the SRE team at WEX. Our products support a wide range of customer businesses and generate complex, high-volume telemetry and operational data across systems and platforms. As WEX scales, reliability, performance, and operational excellence are more essential than ever. As a Senior SRE, you’ll take ownership of initiatives that improve the reliability, scalability, and efficiency of our platforms and services. You’ll lead the design and implementation of tooling and automation to reduce toil, improve observability, optimize performance, and enhance incident and problem management. You’ll drive root cause analysis and resilience engineering, helping teams shift from reactive to proactive practices. You’ll also act as a key partner to engineering and product teams—guiding them on building with reliability in mind, embedding SRE best practices, and influencing platform architecture and operational maturity. We operate with agile methodologies and a product-minded engineering culture, and we leverage modern technologies—including AI—to continuously evolve our reliability capabilities. You’ll work on complex challenges with high business impact and collaborate with a team of talented engineers and leaders who will support and challenge you to grow further as a technical and strategic leader. If you’re passionate about reliability, eager to lead, and ready to make a big impact, this is a great opportunity for you! Responsibilities: Lead efforts in designing and implementing scalable and reliable systems. Develop advanced automation strategies to reduce manual work. Conduct detailed postmortems and implement permanent fixes. Mentor junior engineers and promote best practices across teams. Improve incident response processes and drive MTTR (Mean Time to Recovery) reductions. Optimize cloud infrastructure costs and resource utilization. Influence SRE culture and process improvements. Required Qualification 5+ years of experience in SRE, DevOps, or software engineering roles. Strong programming skills in Python, Go, or Java. Experience with scalable and distributed systems. Experience with monitoring and logging (Grafana, ELK stack, Splunk, etc.). Knowledge of containerization and orchestration (Docker, Kubernetes). Advanced cloud automation experience (AWS, Azure, GCP). Understanding of CI/CD pipelines and version control systems. Knowledge of networking, databases, and storage architectures. Knowledge of incident management frameworks (e.g., xMatters, PagerDuty, Opsgenie). Experience in managing production reliability for real-time systems, score computation services, or policy engines. Preferred Qualification Experience with chaos engineering and fault tolerance strategies. Strong understanding of performance optimization and benchmarking tools. Experience working with security and compliance requirements (SOC2, ISO27001, PCI DSS). Experience with designing and developing AI based solutions. Ability to manage and collaborate across cross-functional teams. Proven ability to reduce operational toil through automation. Expertise in detecting and triaging issues that impact risk scoring accuracy, model drift, or signal degradation. Experience in designing observability into the entire risk signal chain, from ingestion to decision. Familiarity with risk-focused SLIs/SLOs such as % of risk decisions under 100ms, Time to propagate policy changes, Model evaluation completion times etc.

More open positions

Outside Sales Account Representative (Multi-Family)- Renton/Eastside, WA

Work from home Full-time role

Lead Extend Developer

Work from home Full-time role

Account Executive, Co-Packing and Retail Solutions

Work from home Full-time role

Outside Sales Account Manager (Multi-Family) Mississippi Gulf Coast

Work from home Full-time role

Product Designer (Part-Time)

Work from home Full-time role

Remote Customer Support Specialist – Live Chat, Sales Assistance & Technical Tire Expertise for careerzynith

Work from home Full-time role

Manager - Clinical Research Billing and Finance

Work from home Full-time role

[Remote] Account Manager, Donor Advancement

Work from home Full-time role

Internal Posting for CTF Members Only - UU100 VA1 - Mental Health Literacy(Fall 2026)

Work from home Full-time role

Senior Data Analyst - Customer Experience Excellence - Remote Opportunity at careerzynith

Work from home Full-time role

[Remote] Staff Analytics Engineer, Business Intelligence

Work from home Full-time role

AI Performance Optimization Engineer

Work from home Full-time role

[Remote] Systems Analyst III (Senior MDM Administrator/Developer) - 26-07079

Work from home Full-time role

Remote Recruiter | Military Spouse Preferred

Work from home Full-time role

[Remote] Senior AI and ML HPC Cluster Engineer

Work from home Full-time role

Recruiting Specialist

Work from home Full-time role

Remote Customer Service Representative – Part‑Time Weekend Shifts, $18/hr – Flexible Home‑Based Role at careerzynith, Supporting Retail, Technology & Healthcare Clients

Work from home Full-time role

Remote Customer Service Representative – Solar Energy Industry Leader | Entry-Level Position at Careerzynith

Work from home Full-time role

QA Engineer

Work from home Full-time role

Advisory Services Technical Account Manager

Work from home Full-time role

SVP & General Manager, YourCause from Blackbaud

Work from home Full-time role