← all jobs

[Remote] Research Engineer - AI Systems

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Yotta Labs is building the next generation multi-silicon AI cloud and runtime platform to power the world’s most demanding AI workloads. They are seeking a highly motivated AI Systems Research Engineer specializing in Trainium and GPU kernels to optimize AI applications and improve performance on their platform.

Responsibilities

  • Design and implement high-performance kernels for Attention, MoE, GEMM, collective communication, and quantization
  • Optimize kernels for NVIDIA, AMD, and AWS Trainium
  • Develop custom operators and graph optimizations using Neuron SDK, PyTorch/XLA, Torch Dynamo, and Neuron Compiler
  • Improve performance of vLLM, SGLang, TensorRT-LLM, and custom inference runtimes
  • Design scalable distributed training and inference solutions across thousands of accelerators
  • Contribute to open-source projects, publish technical findings and engage with the developer community

Skills

  • Proficiency in AI programming languages such as Python and C++
  • Deep understanding of GPU architecture and performance optimization
  • Experience with CUDA, Triton, ROCm/HIP, or AWS Neuron
  • Strong understanding of AI frameworks (e.g., PyTorch, Dynamo, LMCache), model architectures and profiling tools (e.g. Nsight, ROCm Profiler, or Neuron Profiler)
  • Strong problem-solving skills and the ability to work in a collaborative, remote environment
  • A background in computer science, engineering, or a related field is preferred
  • Contributions to open-source AI infra projects like vLLM, SGLang, PyTorch, or Triton
  • Experience with with FlashAttention, PagedAttention, MoE, RLHF, or distributed AI systems
  • Publications in top-tier conferences like MLSys, OSDI, SOSP, NSDI, SC, HPCA, or ISCA

Benefits

  • Competitive compensation with equity
  • Enjoy a flexible, remote work environment that values innovation and autonomy

Company Overview

  • Building the GPU Cloud for efficient ML with heterogeneous hardware and cross-cloud orchestration It was founded in 2024, and is headquartered in Seattle, Washington, USA, with a workforce of 2-10 employees. Its website is https://yottalabs.ai.
  • Company H1B Sponsorship

  • Yotta Labs has a track record of offering H1B sponsorships, with 2 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Enterprise Account Executive

    Work from home Full-time role

    [Remote] DART- Program Management & Development

    Work from home Full-time role

    [Remote] Customer Service Representative (Remote)

    Work from home Full-time role

    [Remote] AI Engineer - Remote

    Work from home Full-time role

    [Remote] UI/UX Designer - Remote

    Work from home Full-time role

    Utilization Review Nurse-LVN/LPN

    Work from home Full-time role

    Senior Client Services Specialist, Family Office

    Work from home Full-time role

    Owner Relations Agent - 26237

    Work from home Full-time role

    General Application

    Work from home Full-time role

    Remote Water Systems Field Technician - 75% Travel

    Work from home Full-time role

    [Remote] Program Manager, AI Governance & Initiatives

    Work from home Full-time role

    Virtual Biostatistician Hiring Event - reputed company (PhD Statistics/Biostatistics)

    Work from home Full-time role

    [Remote] Patient Financial Advocate

    Work from home Full-time role

    Experienced Full Stack Customer Success Leader – Housing Industry

    Work from home Full-time role

    Experienced Customer Support Specialist - Delivering Exceptional Healthcare Experiences Remotely for CVS Health

    Work from home Full-time role

    Remote Live Chat Customer Support Specialist – Part-Time Work From Home Opportunity with careerzynith

    Work from home Full-time role

    Virtual School-Based Speech-Language Pathologist - Completely Remote- MD

    Work from home Full-time role

    Experienced Chat Specialist – Remote Customer Service Representative

    Work from home Full-time role

    Part Time-Triage Telephone RN- REMOTE

    Work from home Full-time role

    Online Transcription Jobs from Home

    Work from home Full-time role

    Chat Agent - Remote - Flexible Schedule - No Phone Calls - $25-$35/hr

    Work from home Full-time role