← all jobs

[Remote] AI Data Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. They are seeking an AI Data Engineer to build and operate large-scale data systems that power modern AI training and evaluation pipelines.

Responsibilities

  • Design and operate large-scale data pipelines supporting AI training, evaluation, and continual improvement workflows
  • Build ingestion systems for diverse modalities including text, image, audio, video, and structured signals
  • Implement data cleaning, deduplication, filtering, and quality assurance at petabyte scale
  • Develop dataset versioning, lineage, and provenance tracking systems suitable for reproducible training
  • Build high-throughput data loading systems that maximize GPU utilization during training
  • Implement labeling workflows, active learning pipelines, and human-in-the-loop data improvement systems
  • Design storage architectures balancing cost, throughput, and latency across data tiers
  • Build evaluation dataset construction pipelines with strict integrity and contamination controls
  • Implement data privacy, redaction, and consent enforcement throughout the pipeline
  • Collaborate with ML researchers and engineers to align data systems with model development needs
  • Drive observability of data quality, drift, and pipeline health across the AI data estate
  • Optimize cost and performance through compression, format selection, and caching strategies
  • Document data systems, schemas, and operational procedures for broad internal use
  • Stay current with AI data infrastructure research and emerging open-source tools

Skills

  • Bachelor's or Master's degree in Computer Science or a related field
  • Six or more years of data engineering experience, with significant work supporting ML or AI workloads
  • Strong proficiency in Python and at least one JVM or systems language
  • Deep experience with modern data processing frameworks such as Spark, Ray, or Beam
  • Hands-on experience operating petabyte-scale storage and pipeline systems
  • Strong understanding of distributed systems, data modeling, and storage formats
  • Experience with dataset versioning, lineage, and reproducibility for ML workflows
  • Familiarity with high-throughput data loading for accelerator-based training
  • Strong software engineering practices including testing, CI/CD, and code review
  • Excellent communication and cross-functional collaboration skills
  • Experience with multimodal datasets at large scale
  • Familiarity with data quality tooling and dataset evaluation methodology
  • Exposure to privacy-preserving data systems and regulated data handling
  • Open-source contributions to data infrastructure projects
  • Experience supporting frontier model training pipelines

Benefits

  • Competitive base salary commensurate with experience, plus benefits.
  • 100% remote, full-time, direct W2 position with Bright Vision Technologies.
  • No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
  • Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap.
  • We do not engage in C2C, 1099, or third-party arrangements for this role.
  • We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.

Company Overview

  • Bright Vision Technologies is an information technology company that offers software development, AI, and cybersecurity services. It was founded in 2020, and is headquartered in Bridgewater, New Jersey, USA, with a workforce of 51-200 employees. Its website is https://bvteck.com.
  • More open positions

    [Remote] AI Research Engineer

    Work from home Full-time role

    [Remote] Debt Management Account Processor

    Work from home Full-time role

    [Remote] Legal Intake Specialist - Part Time, Weekends

    Work from home Full-time role

    [Remote] Cloud Network Engineer

    Work from home Full-time role

    [Remote] Cloud Networking Engineer

    Work from home Full-time role

    Salesforce Solution Engineer, RCA & RCB

    Work from home Full-time role

    [Remote] Motion Designer/Editor

    Work from home Full-time role

    [Remote-Position] [Entry Level/No Experience] Walgreens Data

    Work from home Full-time role

    [Remote] Business Development Representative

    Work from home Full-time role

    Talent Acquisition Specialist, Tech | FloQast | $80k-$100k | Remote (US)

    Work from home Full-time role

    RN II - Oncology Case Manager-REMOTE BASED

    Work from home Full-time role

    [Remote] Senior Clinical Research Specialist

    Work from home Full-time role

    [Remote] Lead AI Engineer

    Work from home Full-time role

    Order Selector- 2nd Shift

    Work from home Full-time role

    Remote Data Entry Specialist – Entry‑Level, $30/hr, Flexible Home‑Based Role at careerzynith

    Work from home Full-time role

    Procurement Lead – Medicaid

    Work from home Full-time role

    Audio Consultant Engineer - Linux Audio Stack (Remote/Anywhere)

    Work from home Full-time role

    Experienced Customer and Data Support Specialist – Complaint Management and Clinical Solutions

    Work from home Full-time role

    AI & Data Scientist (Remote)

    Work from home Full-time role

    Destination Wedding & Honeymoon Travel Consultant

    Work from home Full-time role

    Téléconseiller 100% Télétravail H/F CDD Bilingue

    Work from home Full-time role