Senior Infrastructure Engineer
About the Role: We're looking for a Senior Infrastructure Engineer to build and run Somnia's key backend services: the L1 and node fleet, RPC and indexing layers, product backends, and developer-facing services teams depend on. An SRE-minded role: you make reliability measurable, rollouts safe, and infrastructure repeatable, so every team can move fast without breaking things. You treat infrastructure as a product: automate relentlessly, measure everything, and leverage AI to accelerate development, operations, and incident response. Key Responsibilities: Define and maintain SLOs, SLIs, and error budgets, plus the observability—metrics, logs, traces and alerts—that catches regressions before users do. Build repeatable, self-service infrastructure through infrastructure-as-code, CI/CD and golden paths so teams can provision, deploy and recover without reinventing the wheel. Own rollouts end-to-end—progressive delivery, canaries, safe migrations and clean rollbacks. Operate the systems behind Somnia's nodes, validators, RPC and indexing, tuning for performance and cost across regions. Lead incident response and on-call, run blameless postmortems, and continuously harden the platform. Partner with product and protocol teams to design and operate production-ready services. You'll rotate between embedding with engineering teams and building the shared platform, tooling and operational standards that underpin the wider organisation. Requirements: Must Have Strong experience operating production infrastructure at scale (cloud and/or bare metal), with deep Linux fundamentals. Experience with infrastructure-as-code such as Terraform or Pulumi, alongside configuration management. Experience running containers and orchestration platforms (Docker, Kubernetes) in production. Strong programming skills, ideally in Go and/or TypeScript, for building automation and internal tooling. Experience with observability stacks (Prometheus, Grafana, OpenTelemetry or equivalents). Experience operating and monitoring distributed systems, including capacity planning and performance tuning. Comfortable operating in high-stakes production environments and responding to incidents. Genuine interest in crypto and on-chain systems.
Nice to Have
Experience operating blockchain node infrastructure (validators, RPC, archive nodes) for an L1/L2. Experience with high-performance networking, low-latency systems or load balancing at scale. Multi-region and geo-distributed deployments with failover strategies. Security and key management (HSMs, secrets management, hardening). EVM tooling and the wider Web3 infrastructure ecosystem. What Success Looks Like Engineers can deploy safely and frequently with confidence. Platform reliability is measurable, with well-defined SLOs and continuously improving service health. Infrastructure is automated, repeatable and increasingly self-service. Incidents become less frequent, easier to diagnose and faster to resolve. Product teams spend more time shipping features and less time managing infrastructure. Why Join Somnia? Design The Future: Join Somnia to work remotely with a global team, earn competitive compensation with token incentives, and help build the future of Web3 at a company where your impact truly matters. First of its Kind: Make Somnia famous as the only hyperspeed L1 with native AI inference. High Stakes: Influence a brand targeting a $1M+ launch budget in the 2026 peak-fragility market. Incentives: Competitive salary + tokens. Role Location The role is remote, but we are looking for someone based in Europe or Asia (preferably).