[Remote] AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Tether.io is pioneering a global financial revolution through their innovative digital finance solutions. They are seeking an AI Research Engineer to drive innovation in multi-modal reinforcement learning, focusing on optimizing decision-making and adaptive behavior across various data modalities to enhance AI performance in real-world challenges.

Responsibilities

Conduct research on reinforcement learning algorithms for multimodal models, including diffusion-based approaches for image autoregressive models for multimodal understanding, and unified frameworks that integrate multiple modalities
Design and build reinforcement learning infrastructure that supports scalable, distributed training across multimodal systems while maintaining efficiency and reliability
Develop and refine reward modeling strategies that improve training stability, align model behavior with desired outcomes, and mitigate reward hacking and related failure modes
Create and curate multimodal simulation environments and datasets to support robust training, evaluation, and benchmarking of reinforcement learning systems
Design and conduct rigorous benchmarking and evaluation protocols to measure model performance, track progress against baselines, and validate improvements across multimodal tasks
Analyze and optimize policy performance across modalities by identifying bottlenecks in training, credit assignment, and cross-modal alignment
Investigate and develop next-generation reinforcement learning paradigms that more effectively learn from environment feedback, with the goal of achieving superior state-of-the-art (SOTA) performance
Publish research findings in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc

Skills

A Master's degree in Computer Science or a related field is required
Proven experience running large-scale reinforcement learning experiments in multimodal and vision-centric systems, including online RL settings, with demonstrated impact on domain-specific decision-making and measurable improvements in policy performance
Deep understanding of reinforcement learning algorithms and optimization methods applied to vision and multimodal learning problems, with a focus on improving policy stability, exploration, and sample efficiency in complex, high-dimensional environments involving images, video, and other modalities
Strong proficiency in PyTorch and deep learning frameworks for vision and multimodal AI, with hands-on experience building end-to-end RL pipelines covering simulation, training, evaluation, and deployment in production-grade systems
Demonstrated ability to apply empirical research to solve core RL challenges in multimodal and vision tasks, such as sample inefficiency, exploration-exploitation tradeoffs, and training instability, along with experience designing robust evaluation frameworks and iterating on algorithmic improvements to advance agent performance
Proven track record of research publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc
A PhD in Machine Learning, NLP, Computer Vision, or a closely related discipline is preferred, along with a strong track record of AI research and publications in top-tier conferences

Company Overview

Tether has evolved to meet global needs with agility and vision. It was founded in 2014, and is headquartered in Seattle, Washington, USA, with a workforce of 201-500 employees. Its website is https://tether.io.

Apply Now

[Remote] AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide

More open positions

[Remote] Mid-Level Process Engineer (Hybrid/Remote)

[Remote] Head of Marketing

[Remote] Senior Product Manager, Provider

[Remote] Senior Product Manager, Trust & Safety, Integrity and Fraud

[Remote] VP, Finance Excellence and Transformation

MLOps Engineer - Remote (AWS Certified Machine Learning)

Healthcare Scheduler Hybrid

Account Executive - Boston

Technical Support Engineer (L1/Frontline Support), EU

Project Delivery Coordinator

[Remote] US Program Director, Emergency Cash

Technical Applications Specialist III (Clinical Integrated Solutions)

Remote Life Insurance Agents Needed - Get Licensed with Our Support | WFH

[Remote] Quality Assurance Engineer

Psychologist (Clinical Resource Hub)

Experienced Customer Service/Inside Sales Representative (Healthcare) – Remote Opportunity at careerzynith

Trust & Safety, Senior Manager

Bharani- Prompt Creation Specialist- Thai (Thailand)

Experienced Full Stack Data Entry Specialist – Remote Data Management and Support

Director, Digital Transformation - Site Management Operations

Remote Social Media Customer Support Representative – Engaging Audiences for careerzynith’s Global Entertainment Brand