← all jobs

[Remote] Senior Deep Learning Software Engineer, TensorRT Performance

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of NVIDIA’s inference ecosystem. The role involves collaborating with teams to develop and optimize deep learning inference software, focusing on performance benchmarking and innovative solutions across various applications.

Responsibilities

  • Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT)
  • Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT
  • Develop new model pipelines for NVIDIA’s inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance
  • Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions
  • Scale performance of deep learning models across different architectures and types of NVIDIA accelerators

Skills

  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI)
  • At least 3 years of relevant software development experience
  • Strong C++, Python programming and software engineering skills
  • Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer)
  • Experience with performance analysis and performance optimization
  • Strong foundation and architectural knowledge of GPUs
  • Deep understanding of modern deep learning models and workloads (e.g. Transformers, Recommenders, ASR, TTS, Visual Understanding)
  • Proficiency in one of the deep learning programming domain specific languages (e.g. CUDA/TileIR/CuTeDSL/cutlass/Triton)
  • Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph compilers in deep learning inference (e.g. TorchDynamo/TorchInductor)
  • Prior experience optimizing performance for low-latency, resource-constrained systems or embedded AI pipelines (e.g. Jetson systems or other edge AI accelerators)

Benefits

  • Equity
  • Benefits

Company Overview

  • NVIDIA is a computing platform company operating at the intersection of graphics, HPC, and AI. It was founded in 1993, and is headquartered in Santa Clara, California, USA, with a workforce of 10001+ employees. Its website is https://www.nvidia.com.
  • Company H1B Sponsorship

  • NVIDIA has a track record of offering H1B sponsorships, with 448 in 2026, 1872 in 2025, 1354 in 2024, 976 in 2023, 835 in 2022, 601 in 2021, 529 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Senior Data Scientist

    Work from home Full-time role

    [Remote] Senior Data Scientist

    Work from home Full-time role

    [Remote] Senior Machine Learning Engineer - Ad Tech

    Work from home Full-time role

    [Remote] Senior Software Engineering Manager- Anthem Integration

    Work from home Full-time role

    [Remote] Sr Advanced Project Engineer - Supplier Development

    Work from home Full-time role

    Territory Manager, Alternate Care

    Work from home Full-time role

    Experienced Full Stack Data Entry Specialist – Remote Data Management for careerzynith

    Work from home Full-time role

    Experienced Customer Service Representative – Remote Work Opportunity with careerzynith

    Work from home Full-time role

    [Remote] Creative Design Project Consultant

    Work from home Full-time role

    Senior Product Manager (m/f/x) - Remarketing

    Work from home Full-time role

    [Remote] Account Executive - Ancillary Benefits

    Work from home Full-time role

    Clinical Assocaite

    Work from home Full-time role

    Genetic Counsellor

    Work from home Full-time role

    Contract Development Manager

    Work from home Full-time role

    Entry Level Virtual Admin Assistant with a Knack for Proofreading Needed. No Certs Needed - Contract to Hire

    Work from home Full-time role

    Entry-Level Remote Data Entry Specialist – Precision Database Management for careerzynith Airline Operations

    Work from home Full-time role

    Fullstack Engineer- Typescript, Golang or Python, AWS)

    Work from home Full-time role

    Remote Data Entry Specialist – High‑Volume Travel Management & Revenue Optimization – $26/hr at careerzynith

    Work from home Full-time role

    [Remote] Sr. Manager, Mechanical & Plumbing Engineer - Data Centers

    Work from home Full-time role

    [Remote] Strategic Account Executive, Enterprise

    Work from home Full-time role

    [Remote] Senior Financial Analyst, Retailer

    Work from home Full-time role