← all jobs

[Remote] Research Scientist, Data

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Pika is pioneering the next generation of creative infrastructure built around real-time, multimodal generation and intelligent agentic platforms. They are looking for a staff or lead-level Research Engineer, Data to architect and scale data engineering systems supporting model training for advanced multimodal foundation models.

Responsibilities

  • Take ownership of large-scale data pipeline architecture and implementation to support model training and research workflows for text, image, audio, and video datasets
  • Partner with research and engineering teams to curate, clean, and manage diverse, sensory-rich datasets for pre-training and mid-training of multimodal models
  • Develop strategies and tools for scalable data ingestion, labeling, filtering, augmentation, and storage
  • Ensure data quality, reliability, and compliance, including managing privacy and ethical considerations throughout the data lifecycle
  • Optimize data processing, transformation, and delivery for large-scale distributed training pipelines
  • Prototype and productionize new methods for dataset creation, management, and continuous improvement in response to researcher needs
  • Contribute to the integration of research-driven data advancements into production-ready systems
  • Stay informed on emerging data engineering and ML data management developments, bringing best practices to our systems

Skills

  • 5+ years of experience building and scaling data pipelines for machine learning applications at staff or lead engineer level, ideally in research or model training environments
  • Strong background in data engineering and ML data curation for LLMs, VLMs, or other large-scale multimodal models
  • Expertise in distributed data systems (e.g., Spark, Hadoop, Ray, or similar) and efficient large dataset processing/ETL workflows
  • Proven ability to build robust, scalable, and production-grade data infrastructure for ML pipelines
  • Experience developing tools for data labeling, filtering, deduplication, quality assurance, and dataset management
  • Strong programming skills (Python, SQL, PySpark, or similar) and familiarity with cloud data platforms (AWS, GCP, Azure)
  • Knowledge of privacy, compliance, ethics, and best practices in data collection and management
  • Excellent cross-functional collaboration, problem-solving, and communication skills
  • Passion for enabling cutting-edge generative AI and creative technology through data excellence

Benefits

  • Competitive salary and substantial equity in a high-growth startup
  • Full health benefits, 401k matching, and more
  • Collaborative, mission-driven team environment with major growth opportunities
  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

Company Overview

  • Pika is an AI platform that allows users to create videos from text prompts, including text to video, image to video, and editing tools. It was founded in 2023, and is headquartered in Palo Alto, California, USA, with a workforce of 2-10 employees. Its website is https://pika.art.
  • Company H1B Sponsorship

  • Pika has a track record of offering H1B sponsorships, with 9 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Senior Director, Corporate Systems- Finance Analytics & Reporting

    Work from home Full-time role

    [Remote] Strategic Sales Director

    Work from home Full-time role

    [Remote] Cereals Product Manager

    Work from home Full-time role

    [Remote] Manager, Business Systems & Analytics

    Work from home Full-time role

    [Remote] Account Executive, Social & Influencer

    Work from home Full-time role

    Mechanical Engineer II

    Work from home Full-time role

    Talent Acquisition Director

    Work from home Full-time role

    Bilingual sales representative (experience in sales SAS)

    Work from home Full-time role

    [Remote] Principal Customer Success Manager - Remote

    Work from home Full-time role

    Product Manager – Animal Health / Pharmaceutical Manufacturing

    Work from home Full-time role

    Immediate Hiring: careerzynith Data Entry Remote Job $27/Hour - Experienced Full Stack Data Entry Specialist

    Work from home Full-time role

    Vendor Management Coordinator

    Work from home Full-time role

    Outbound Inside Sales Account Executive

    Work from home Full-time role

    Join Today: Part Time Remote Data Entry Job (Walmart Part Time) –

    Work from home Full-time role

    Executive Admin / Chief of Staff

    Work from home Full-time role

    Senior IRB Specialist, Campus Team, (Remote)

    Work from home Full-time role

    [Remote] Technical Writer

    Work from home Full-time role

    Remote Weekend & Evening Registered Dietitian / Certified Nutrition Specialist

    Work from home Full-time role

    Entry Level Chat Assistant - Remote job for reputed company (Part Time)

    Work from home Full-time role

    [Remote] Safety Engineer I and II - Civil

    Work from home Full-time role

    Search and NLP Engineer - Virtual

    Work from home Full-time role