[Remote] Software Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Gold Group Ltd is a leading AI research institute seeking a Software Engineer to join their Benchmarking team. The role involves developing evaluations of AI models and collaborating with researchers to influence the AI community.

Responsibilities

Develop and run evaluations of the latest AI models
Build new benchmarks
Maintain evaluation infrastructure
Collaborate directly with researchers producing work that influences policymakers, industry leaders, and the wider AI community

Skills

Strong software engineering experience (language agnostic – Python preferred)
An interest in LLM evaluations, benchmarking, or AI capability testing
Curiosity about frontier AI and a research-oriented mindset
Someone who enjoys experimentation, solving difficult technical problems, and improving evaluation frameworks
Experience with evaluation frameworks such as Inspect

Benefits

Fully remote
Three international company retreats each year
Flexible working hours

Company Overview

Gold Group is celebrating 25 years in Recruitment! As one of the UK’s leading independently owned technical and professional recruitment consultancies. It was founded in 2000, and is headquartered in East Grinstead, West Sussex, GBR, with a workforce of 11-50 employees. Its website is https://www.goldgroup.co.uk/.

Apply Now

[Remote] Software Engineer

More open positions

[Remote] 100% Remote - Sr. Clinical Advisor

[Remote] Lead OCM Consultant -Organizational Change Management

[Remote] Director, Product Management - Brokerage

[Remote] Director of Mechanical Engineering (Building Systems)

[Remote] Director, Clinical Strategy & Operations

Assoc Clinical Specialist -1

[Remote] Application Security AI Engineer

Fractional CFO | Non-Profit

Senior AI & Cloud Engineer

Director, Commercial, Switzerland

MariaDB SME/Technical Architect

Senior Manager, Supply Chain

Telehealth Nurse Practitioner | Upto $75/hr Remote

[Remote] Senior QA Automation Engineer

[Hiring] Nurse Auditor Revenue Integrity @Trinity Health

[Remote] Senior Software Engineer II AI-Native, Mobile, Developer Experience

Full-charge Bookkeeper

Mobile Lending Specialist

[Remote] Senior Site Reliability Engineer

Remote Cognitive Behavioural Psychotherapist - Young Persons

Hotel Contractor – Luxury Travel