← all jobs

Member of Technical Staff, Inference

Work from home Full-time role Hiring

Inferact's mission is to grow vLLM as the world's AI inference engine and accelerate AI progress by making inference cheaper and faster. Founded by the creators and core maintainers of vLLM, we sit at the intersection of models and hardware—a position that took years to build.

About the Role

We're looking for an inference runtime engineer to push the boundaries of what's possible in LLM and diffusion model serving. Models grow larger. Architectures shift: mixture-of-experts, multimodal, agentic. Every breakthrough demands innovations on the inference engine itself. You'll work at the core of vLLM, optimizing how models execute across diverse hardware and architectures. Your work will directly impact how the world runs AI inference. Skills and Qualifications Minimum qualifications: Bachelor's degree or equivalent experience in computer science, engineering, or similar. Deep understanding of transformer architectures and their variants. Strong programming skills in Python with experience in PyTorch internals. Experience with LLM inference systems (vLLM, TensorRT-LLM, SGLang, TGI). Ability to read and implement model architectures and inference techniques from research papers. Demonstrate the ability to contribute performant and maintainable code and debug in complex ML codebases. Preferred qualifications: Deep understanding of KV-cache memory management, prefix caching, and hybrid model serving. Familiarity with RL frameworks and algorithms for LLMs. Experience with multimodal inference (audio/image/video/text). Contributions to open-source ML or system infrastructure projects. Bonus points if you have: Implemented core features in vLLM or other inference engine projects. Contributed to vLLM integrations (verl, OpenRLHF, Unsloth, LlamaFactory, etc). Written widely-shared technical blogs or side projects on vLLM or LLM inference. Logistics Location: This role is based in San Francisco, California. Will consider remote in the US for exceptional candidates. Compensation: Depending on background, skills, and experience, the expected annual salary range for this position is $200,000 - $400,000 USD + equity. Visa sponsorship: We sponsor visas on a case-by-case basis. Benefits: Inferact offers generous health, dental, and vision benefits as well as 401(k) company match.

More open positions

Product Manager, Canvas - US Remote

Work from home Full-time role

Project Coordinator HEALTH PROGRAM (New Mexico REMOTE)

Work from home Full-time role

Scientific Writer/Editor - ON-CALL

Work from home Full-time role

Senior Account Executive, AI Infrastructure Sales

Work from home Full-time role

Senior Account Executive

Work from home Full-time role

Registered Nurse (RN), House Supervisor, Weekend Option

Work from home Full-time role

Senior iOS Developer (Remote - 6 months contract)

Work from home Full-time role

Remote Logistics Coordinator – 3rd Shift Night Shift Opportunity with Manpower in Melbourne, FL

Work from home Full-time role

Virtual Care Technician (Remote)

Work from home Full-time role

[Remote] Director, Performance Marketing

Work from home Full-time role

RN CLINICAL DOCUMENTATION SPECIALIST - ENTERPRISE QUALITY CDI/UR/DM

Work from home Full-time role

2D Animator – Toon Boom Harmony Remote

Work from home Full-time role

Customer Engineer - M365, Copilot & SharePoint: Unlocking Exceptional Customer Experiences

Work from home Full-time role

Real Estate Associate Agent (1099) - Georgia (Future Openings)

Work from home Full-time role

Senior Technical Writer Documentation and education · Multiple locations · Fully Remote

Work from home Full-time role

Senior Business Immigration Paralegal (Perm Cases)

Work from home Full-time role

Network Administrator/ Network Engineer - Remote + W2 Only

Work from home Full-time role

Director, Quality Assurance - Vendor Management job at Vaxcyte in San Carlos, CA

Work from home Full-time role

[Remote] Sr. Tableau / Alteryx Administrator

Work from home Full-time role

Reports Administrator

Work from home Full-time role

Part-Time Remote careerzynith Customer Service Representative – Flexible Hours, Immediate Start, Home‑Based Support Role

Work from home Full-time role