[Remote] AI Agent Engineer
Note: The job is a remote job and is open to candidates in USA. D24 Search is a consumer AI startup building a platform for interactive mini-apps. They are seeking an AI Agent Engineer to lead the development of the core engine that transforms natural language into interactive experiences, impacting over 1 million monthly users.
Responsibilities
- Design and own the agent runtime and orchestration layer for our coding agent
- Build long-horizon agent workflows: prompt → plan → generate → run/validate → repair → publish
- Develop robust evaluation and quality loops including eval harnesses, regression testing, and failure taxonomy
- Design model strategies including routing, benchmarking, reliability improvements, and cost/latency optimization
- Create debuggable agent systems with tracing, metrics, alerts, and observability
Skills
- 1 - 8 years of experience in production AI agentic systems or AI/ML engineering
- Experience building agentic systems OR AI/ML engineering in a production environment
- Shipped in large, shared codebases at scale
- Bachelor's degree in Computer Science or related field
- Strong proficiency in Python
- Experience building agent evaluation frameworks including eval harnesses, A/B testing agent changes, and statistically grounded model measurement
- Familiarity with modern agent frameworks, specifically LangGraph and Pydantic AI
- Experience in async programming and distributed systems – Kafka, Spark, Flink, or equivalent
- Communicates impact in business terms — resume should reflect measurable outcomes on users/product, not just technical metrics
- Must be able to work remotely in PST, with a Sunday evening standup
- AI-native — actively follows latest agent framework releases, open source updates, and eval tooling
- Experience at a VC-backed AI startup — coding agent or developer tool company strongly preferred
- Experience building a coding agent, devtool, or IDE assistant
Company Overview