← all jobs

[Remote] Senior Systems Engineer- Network Infrastructure

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Nscale is building next-generation AI infrastructure from the ground up, aiming to deliver reliable and scalable network clusters for large-scale AI training and inference. The Senior Systems Engineer will lead the deployment of network clusters, ensuring they are validated and production-ready, while also contributing to automation and process improvement.

Responsibilities

  • Execute end-to-end bringup of network nodes and racks from installation to production readiness
  • Validate BIOS/BMC/firmware configurations and network health
  • Perform rack-level integration including power, cabling, and airflow validation
  • Bring up and validate high-speed network fabrics (InfiniBand, RoCE, 100–400G Ethernet)
  • Configure and validate leaf/spine network connectivity
  • Run cluster-wide burn-in and stress testing
  • Validate node-to-node performance (NCCL, RDMA, GPUDirect)
  • Troubleshoot hardware, firmware, and fabric-level issues
  • Contribute to automation for provisioning and cluster validation
  • Improve deployment playbooks and documentation
  • Identify reliability issues early and drive corrective actions
  • Help turn ad hoc deployments into repeatable systems
  • Work closely with networking, systems software, and data center teams
  • Coordinate with hardware vendors to resolve bringup issues
  • Support rapid capacity expansion as we scale

Skills

  • 5–8+ years in infrastructure engineering, hardware deployment, or data center operations
  • Hands-on experience deploying network servers (HGX/DGX or similar platforms)
  • Experience with high-speed networking (InfiniBand, RoCE, Ethernet fabrics)
  • Strong Linux systems knowledge
  • Experience troubleshooting distributed systems performance issues
  • Comfortable working onsite in data center environments as needed
  • Experience in AI/ML infrastructure or HPC environments
  • Familiarity with NCCL, CUDA, RDMA
  • Automation experience (Python, Ansible, Terraform, Bash)
  • Experience in high-density power and cooling environments

Company Overview

  • Nscale builds AI data centers and provides GPU cloud infrastructure that companies use to train, run, and scale large AI models. It was founded in 2024, and is headquartered in London, England, GBR, with a workforce of 201-500 employees. Its website is https://www.nscale.com.
  • More open positions

    [Remote] Legal Editor

    Work from home Full-time role

    [Remote] Sr. National Account Manager

    Work from home Full-time role

    [Remote] Sr Salesforce Health Cloud Data Engineer - REMOTE

    Work from home Full-time role

    [Remote] Strategic Account Manager

    Work from home Full-time role

    [Remote] EPIC Integration Engineer (Cheers, Clinic & Call Center)

    Work from home Full-time role

    Medical Credentialing Specialist - Remote (DFW Residents only) 2 yrs exp req

    Work from home Full-time role

    Sr. On-Site Project Manager

    Work from home Full-time role

    Marketing Specialist, Global Programs

    Work from home Full-time role

    Sales Manager B2B / Telesales - 100% Remote in Hamburg und Umgebung (d/m/w)

    Work from home Full-time role

    Remote Sales Rep - Flexible Hours, No Cold Calls, Part-time/Full-time | WFH

    Work from home Full-time role

    Remote Real Estate Accountant

    Work from home Full-time role

    Partner Account Manager

    Work from home Full-time role

    Hospitality Customer Service Associate

    Work from home Full-time role

    Shopify Developer Remote / Mid Shift / HMO plus Dependent Coverage

    Work from home Full-time role

    Adjunct Faculty, Data Analytics, Artificial Intelligence

    Work from home Full-time role

    Senior Backend Engineer II

    Work from home Full-time role

    CDI RN Specialist - Remote $10K Sign On Bonus

    Work from home Full-time role

    Freelance Prose Writer (Romance Web-Fiction)

    Work from home Full-time role

    Faculty – Prelicensure Nursing, Multistate

    Work from home Full-time role

    Fleet Service Coordinator II

    Work from home Full-time role

    Voice Actor Prompt Writer [Remote]

    Work from home Full-time role