← all jobs

[Remote] Software Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Gold Group Ltd is a leading AI research institute seeking a Software Engineer to join their Benchmarking team. The role involves developing evaluations of AI models and collaborating with researchers to influence the AI community.

Responsibilities

  • Develop and run evaluations of the latest AI models
  • Build new benchmarks
  • Maintain evaluation infrastructure
  • Collaborate directly with researchers producing work that influences policymakers, industry leaders, and the wider AI community

Skills

  • Strong software engineering experience (language agnostic – Python preferred)
  • An interest in LLM evaluations, benchmarking, or AI capability testing
  • Curiosity about frontier AI and a research-oriented mindset
  • Someone who enjoys experimentation, solving difficult technical problems, and improving evaluation frameworks
  • Experience with evaluation frameworks such as Inspect

Benefits

  • Fully remote
  • Three international company retreats each year
  • Flexible working hours

Company Overview

  • Gold Group is celebrating 25 years in Recruitment! As one of the UK’s leading independently owned technical and professional recruitment consultancies. It was founded in 2000, and is headquartered in East Grinstead, West Sussex, GBR, with a workforce of 11-50 employees. Its website is https://www.goldgroup.co.uk/.
  • More open positions

    [Remote] 100% Remote - Sr. Clinical Advisor

    Work from home Full-time role

    [Remote] Lead OCM Consultant -Organizational Change Management

    Work from home Full-time role

    [Remote] Director, Product Management - Brokerage

    Work from home Full-time role

    [Remote] Director of Mechanical Engineering (Building Systems)

    Work from home Full-time role

    [Remote] Director, Clinical Strategy & Operations

    Work from home Full-time role

    Assoc Clinical Specialist -1

    Work from home Full-time role

    [Remote] Application Security AI Engineer

    Work from home Full-time role

    Fractional CFO | Non-Profit

    Work from home Full-time role

    Senior AI & Cloud Engineer

    Work from home Full-time role

    Director, Commercial, Switzerland

    Work from home Full-time role

    MariaDB SME/Technical Architect

    Work from home Full-time role

    Senior Manager, Supply Chain

    Work from home Full-time role

    Telehealth Nurse Practitioner | Upto $75/hr Remote

    Work from home Full-time role

    [Remote] Senior QA Automation Engineer

    Work from home Full-time role

    [Hiring] Nurse Auditor Revenue Integrity @Trinity Health

    Work from home Full-time role

    [Remote] Senior Software Engineer II AI-Native, Mobile, Developer Experience

    Work from home Full-time role

    Full-charge Bookkeeper

    Work from home Full-time role

    Mobile Lending Specialist

    Work from home Full-time role

    [Remote] Senior Site Reliability Engineer

    Work from home Full-time role

    Remote Cognitive Behavioural Psychotherapist - Young Persons

    Work from home Full-time role

    Hotel Contractor – Luxury Travel

    Work from home Full-time role