[Remote] Senior Software Engineer, Infrastructure and Tooling Lead - Automation
Note: The job is a remote job and is open to candidates in USA. NVIDIA is looking for a senior technical lead to drive infrastructure and tooling development for their Automation team. This role will focus on building scalable internal platforms, automation frameworks, developer productivity tools, and LLM-powered workflows that improve engineering efficiency across complex software development and validation environments.
Responsibilities
- Lead the design and development of infrastructure, automation frameworks, and internal engineering tools
- Build scalable services, APIs, dashboards, workflow engines, and integrations that improve developer efficiency and operational visibility
- Develop LLM-based workflows for triage, summarization, code and log analysis, test workflow assistance, report generation, and knowledge retrieval
- Integrate tooling with CI/CD systems, source control, issue tracking, test infrastructure, dashboards, and internal engineering services
- Define architecture, coding standards, evaluation methods, and reliability practices for automation and LLM-enabled systems
- Mentor engineers, review designs, and provide technical leadership across infrastructure and tooling projects
Skills
- BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or equivalent experience
- 8+ years of software engineering experience, with strong hands-on development skills
- Proven experience building infrastructure, automation systems, developer tools, workflow platforms, or internal engineering services
- Strong programming experience in Python, Bash, C, and C++, with experience building infrastructure, automation, and systems-level tooling in Linux-based environments
- Experience designing systems that integrate with CI/CD pipelines, source control systems, issue trackers, databases, APIs, and distributed services
- Hands-on experience developing LLM-based workflows, agents, RAG systems, timely pipelines, or AI-assisted automation tools
- Practical understanding of LLM workflow reliability, including evaluation, guardrails, error handling, observability, and human-in-the-loop review
- Strong technical leadership, architecture ownership, mentoring, and cross-team collaboration skills
- Experience building engineering efficiency platforms or automation infrastructure for large-scale software organizations
- Experience with test automation, validation infrastructure, build systems, release workflows, or developer experience tooling
- Familiarity with embeddings, vector search, RAG, model evaluation, agent orchestration, or LLM workflow frameworks
- Strong background in Linux, containers, Kubernetes, cloud or on-prem infrastructure, and distributed systems
- Prior experience leading a small technical team or serving as a technical lead for multi-functional infrastructure projects
Benefits
- Equity
- Benefits
Company Overview
Company H1B Sponsorship