[Remote] Senior Site Reliability Engineer, Robotics & Cloud Infrastructure

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Bedrock Ocean Exploration is building autonomous ocean intelligence that will enable the ocean economy to solve the world’s most pressing challenges in maritime security, infrastructure, energy, and climate. The role of Senior Site Reliability Engineer focuses on ensuring the reliability of systems involved in autonomous underwater vehicle operations and cloud data processing, while promoting automation and operational excellence.

Responsibilities

Own reliability across the full path from vehicle to customer: AUV onboard compute (Jetson-class modules, ROS 2), topside/operator systems, cloud data pipelines, and the platform that delivers data products
Build and extend infrastructure automation- provisioning, configuration management, deployment, and self-recovery- so that routine field operations and pipeline runs require minimal manual intervention
Design and improve observability: metrics, logging, tracing, and alerting that give both robotics and data teams early, actionable signal across vehicle fleets and cloud services
Drive down on-call burden by identifying and eliminating single points of failure, writing runbooks, and automating the manual steps that currently require tribal knowledge
Participate in a shared on-call rotation covering both robotics-side and cloud-side incidents in 12-hour shifts spanning European and East Coast business hours; lead and contribute to blameless post-incident reviews
Define and track reliability targets, availability, data yield, recovery time, tied to continuous-operations goals, and partner with robotics and data teams to meet them
Manage cloud infrastructure on AWS (compute, storage, networking, IaC, cost, and security posture) for data processing and platform workloads
Improve fleet- and vehicle-level configuration management, deployment safety, and rollback so changes reach the field reliably and predictably

Skills

5+ years in an SRE, DevOps, or infrastructure engineering role running production systems with real uptime and on-call responsibilities, including senior-level ownership of reliability outcomes
Experience implementing a scalable incident management and operational excellence mechanism that treats operators as customers, building processes and tooling that serve the people running operations day to day, not just the engineering team
Strong automation instincts: comfortable scripting and building tooling in Python and/or Go and Bash, and using infrastructure-as-code (Terraform or equivalent)
Hands-on AWS experience across compute, storage, networking, and IAM, plus containerization and orchestration (Docker, Kubernetes or similar)
Working knowledge of Linux internals, networking, and observability tooling (Prometheus/Grafana or equivalents)
Comfort operating across environments that aren't just cloud: embedded or edge compute, intermittent connectivity, and physical systems that fail in messy ways
A reliability mindset: you instrument before you guess, you automate the second time you do something manually, and you write things down so the next person or the system can handle it without you
Strong ownership and communication in a small, fast-moving team
East Coast location is required to support coverage across both European operations and the East Coast during 12-hour on-call shifts
Travel to field deployments and Richmond HQ is expected (approximately 5–15%)
Candidates must have legal authorization to work in the United States without visa sponsorship
Due to the nature of our government and defense work, candidates must be eligible to obtain a U.S. Secret security clearance if requested
Experience with robotics or embedded systems: ROS / ROS 2, Jetson or similar edge compute, sensor integration
Background supporting field operations, autonomous systems, or hardware-in-the-loop environments
Familiarity with data pipelines and geospatial or large-binary data formats
Experience standing up on-call practices and incident response from an early stage
Some connection to the ocean: professional, academic, or personal. You're excited to be around people who dive, sail, build, and explore offshore
Active U.S. Secret security clearance or above

Company Overview

Bedrock Ocean Exploration is a platform for underwater vehicles. It was founded in 2020, and is headquartered in Brooklyn, New York, USA, with a workforce of 11-50 employees. Its website is https://bedrockocean.com.

Apply Now

[Remote] Senior Site Reliability Engineer, Robotics & Cloud Infrastructure

More open positions

[Remote] Industry Education & Training Senior Specialist

[Remote] Principal Software Engineer (Ai)

[Remote] Clinical Research Associate II - Oncology WI- Remote

[Remote] Project Manager, Large Scale Projects

[Remote] Clinical Research Associate I - Oncology- WA-Remote

[Remote] Network Engineer (Arista)

[Remote] Key Account Manager- Packaging & Labeling

Quality Control Inspector (SL) - Haining

Senior Academic Advisor (Remote) in Culver City, CA

Project Administrator

[Remote] Account Executive

Evening Gown & Cocktail Dress Seamstress – Alterations – Leesburg, VA

Remote Data Entry Clerk – Precision Data Management & Quality Assurance at careerzynith (Flexible Schedule)

Technical Lead, Field Services

Remote Data Entry Consultant – Global Equity & Stock Plan Administration (Entry Level) – $25/hr – Join careerzynith

AI Solutions Architect

RN Nurse Navigator – Chronic Care Specialty Team – Full Time – Remote, PA

[Remote] Mechanical Field Service Technician Job Details | our company

AI Integration & Data Engineer (Vibecoding) - Master-Level Internship

[Remote] Risk Consultant

Field Activation Sales Manager