← all jobs

[Remote] AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Tether.io is pioneering a global financial revolution through their innovative digital finance solutions. They are seeking an AI Research Engineer to drive innovation in multi-modal reinforcement learning, focusing on optimizing decision-making and adaptive behavior across various data modalities to enhance AI performance in real-world challenges.

Responsibilities

  • Conduct research on reinforcement learning algorithms for multimodal models, including diffusion-based approaches for image autoregressive models for multimodal understanding, and unified frameworks that integrate multiple modalities
  • Design and build reinforcement learning infrastructure that supports scalable, distributed training across multimodal systems while maintaining efficiency and reliability
  • Develop and refine reward modeling strategies that improve training stability, align model behavior with desired outcomes, and mitigate reward hacking and related failure modes
  • Create and curate multimodal simulation environments and datasets to support robust training, evaluation, and benchmarking of reinforcement learning systems
  • Design and conduct rigorous benchmarking and evaluation protocols to measure model performance, track progress against baselines, and validate improvements across multimodal tasks
  • Analyze and optimize policy performance across modalities by identifying bottlenecks in training, credit assignment, and cross-modal alignment
  • Investigate and develop next-generation reinforcement learning paradigms that more effectively learn from environment feedback, with the goal of achieving superior state-of-the-art (SOTA) performance
  • Publish research findings in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc

Skills

  • A Master's degree in Computer Science or a related field is required
  • Proven experience running large-scale reinforcement learning experiments in multimodal and vision-centric systems, including online RL settings, with demonstrated impact on domain-specific decision-making and measurable improvements in policy performance
  • Deep understanding of reinforcement learning algorithms and optimization methods applied to vision and multimodal learning problems, with a focus on improving policy stability, exploration, and sample efficiency in complex, high-dimensional environments involving images, video, and other modalities
  • Strong proficiency in PyTorch and deep learning frameworks for vision and multimodal AI, with hands-on experience building end-to-end RL pipelines covering simulation, training, evaluation, and deployment in production-grade systems
  • Demonstrated ability to apply empirical research to solve core RL challenges in multimodal and vision tasks, such as sample inefficiency, exploration-exploitation tradeoffs, and training instability, along with experience designing robust evaluation frameworks and iterating on algorithmic improvements to advance agent performance
  • Proven track record of research publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc
  • A PhD in Machine Learning, NLP, Computer Vision, or a closely related discipline is preferred, along with a strong track record of AI research and publications in top-tier conferences

Company Overview

  • Tether has evolved to meet global needs with agility and vision. It was founded in 2014, and is headquartered in Seattle, Washington, USA, with a workforce of 201-500 employees. Its website is https://tether.io.
  • More open positions

    [Remote] Mid-Level Process Engineer (Hybrid/Remote)

    Work from home Full-time role

    [Remote] Head of Marketing

    Work from home Full-time role

    [Remote] Senior Product Manager, Provider

    Work from home Full-time role

    [Remote] Senior Product Manager, Trust & Safety, Integrity and Fraud

    Work from home Full-time role

    [Remote] VP, Finance Excellence and Transformation

    Work from home Full-time role

    MLOps Engineer - Remote (AWS Certified Machine Learning)

    Work from home Full-time role

    Healthcare Scheduler Hybrid

    Work from home Full-time role

    Account Executive - Boston

    Work from home Full-time role

    Technical Support Engineer (L1/Frontline Support), EU

    Work from home Full-time role

    Project Delivery Coordinator

    Work from home Full-time role

    [Remote] US Program Director, Emergency Cash

    Work from home Full-time role

    Technical Applications Specialist III (Clinical Integrated Solutions)

    Work from home Full-time role

    Remote Life Insurance Agents Needed - Get Licensed with Our Support | WFH

    Work from home Full-time role

    [Remote] Quality Assurance Engineer

    Work from home Full-time role

    Psychologist (Clinical Resource Hub)

    Work from home Full-time role

    Experienced Customer Service/Inside Sales Representative (Healthcare) – Remote Opportunity at careerzynith

    Work from home Full-time role

    Trust & Safety, Senior Manager

    Work from home Full-time role

    Bharani- Prompt Creation Specialist- Thai (Thailand)

    Work from home Full-time role

    Experienced Full Stack Data Entry Specialist – Remote Data Management and Support

    Work from home Full-time role

    Director, Digital Transformation - Site Management Operations

    Work from home Full-time role

    Remote Social Media Customer Support Representative – Engaging Audiences for careerzynith’s Global Entertainment Brand

    Work from home Full-time role