← all jobs

Senior AI Testing Engineer (Generative AI)

Work from home Full-time role Hiring

Senior AI Testing Engineer (Generative AI) Location India Remote / Hybrid / In-office — [specify your actual working model here] Experience 5–8 years total experience in software testing, QA engineering, or SDET roles, with at least 2–3 years of meaningful, hands-on exposure to Generative AI systems, LLM applications, or AI quality engineering. Role Overview We are looking for a Senior AI Testing Engineer to own quality across our Generative AI products and platform. This role is fundamentally about engineering quality into AI systems — not running test scripts. You'll design evaluation frameworks, build automated testing pipelines, and define what "good" looks like for LLM outputs, RAG systems, AI agents, and voice AI applications. You'll work directly with AI engineers and product teams to make sure our systems are reliable, safe, and measurably improving over time. If you understand how LLMs fail, know how to catch hallucinations before users do, and want to build the quality infrastructure that underpins production AI at scale — this is the role.

Key Responsibilities

Evaluation Strategy & Frameworks · Design and own comprehensive testing strategies for Generative AI products — including LLM applications, RAG pipelines, AI agents, voice AI systems, and workflow automation · Define evaluation methodologies covering functional testing, response quality, hallucination detection, safety and guardrail testing, prompt injection, bias and toxicity, retrieval quality, latency benchmarking, and agent workflow validation · Build reusable AI testing frameworks and automation pipelines for continuous evaluation · Create datasets, benchmark suites, and golden test sets for GenAI evaluation Automated Evaluation · Develop automated evaluation pipelines using LLM-as-a-Judge and hybrid evaluation methods · Implement CI/CD-integrated AI evaluation pipelines · Drive observability and monitoring strategies for production AI systems Quality Standards & Collaboration · Define measurable quality KPIs for AI systems · Establish testing standards, best practices, and governance processes for GenAI applications · Work closely with AI engineers, product, and platform teams to embed quality throughout the development lifecycle Required Skills & Experience Testing & Engineering Experience · 5–8 years in software testing, QA engineering, SDET, or test automation · 2–3 years of hands-on experience testing or evaluating production-grade Generative AI or LLM-based systems · Strong test automation skills in Python · Experience designing scalable automated testing frameworks · Familiarity with API testing, integration testing, and performance testing Generative AI Knowledge · Solid understanding of how LLM systems work — and how they fail · Experience with RAG architectures, prompt engineering, AI agents, embedding models, and vector databases · Understanding of LLM evaluation methodologies and AI system failure modes GenAI Testing Frameworks · Hands-on experience with at least one or more GenAI evaluation frameworks, such as: DeepEval, Ragas, LangSmith, Promptfoo, TruLens, OpenAI Evals, or LangChain evaluation tools Quality Engineering · Expertise in test strategy, test planning, test automation architecture, defect lifecycle management, and quality metrics · Ability to define and track measurable quality KPIs for AI systems Preferred Qualifications · Experience with cloud platforms (AWS, Azure, or GCP) · Familiarity with MLOps / LLMOps workflows · Experience with CI/CD pipelines and DevOps practices · Exposure to monitoring and observability tooling for AI systems · Understanding of security and compliance for GenAI products · Experience with conversational AI or voice AI systems

More open positions

Senior Data Analyst – AI & Automation

Work from home Full-time role

Cyber Defense Analyst

Work from home Full-time role

CAS Prin EP Mapping Specialist - Columbus

Work from home Full-time role

Territory Manager Cardiac Ablation Solutions

Work from home Full-time role

Sales Development Manager

Work from home Full-time role

Experienced Customer Service Representative – Delivering Exceptional Experiences in a Remote New York State Setting

Work from home Full-time role

Principal Systems Engineer

Work from home Full-time role

Regional MDS Coordinator - Registered Nurse

Work from home Full-time role

Senior Customer Experience Operations Manager

Work from home Full-time role

Full Spectrum Cybersecurity Vice President

Work from home Full-time role

Online Airport Customer Service Representative – Digital Passenger Support & Travel Assistance at careerzynith

Work from home Full-time role

Global Market Development Manager

Work from home Full-time role

Bid Manager

Work from home Full-time role

[Remote] Actuarial Analyst II | Workers' Compensation

Work from home Full-time role

Senior Strategic Agency Manager, careerzynith Customer Solutions

Work from home Full-time role

Remote Home‑Based Data Entry Specialist – Flexible Survey Participation & Consumer Insight Contributor

Work from home Full-time role

Account Development Manager

Work from home Full-time role

Senior Data Analyst – Remote Part‑Time – Advanced Analytics, Business Intelligence & Insight Generation at careerzynith

Work from home Full-time role

Procurement Specialist - Los Angeles, CA

Work from home Full-time role

PTCB / ExCPT Certified Pharmacy Technician – Remote (FL, IN, IL, WI)

Work from home Full-time role

Full-Time Virtual Customer Service & Appointment Setting Specialist – Spa & Salon Industry – Phone & Chat (Contract‑to‑Hire) – careerzynith

Work from home Full-time role