[Remote] Senior Backend Engineer - Provider Directory
Note: The job is a remote job and is open to candidates in USA. b.well Connected Health is addressing the fragmentation problem in healthcare through their FHIR-based health data management platform. The Senior Backend Engineer will be responsible for the ingestion, data quality, and operational layer of the provider directory, ensuring reliable data processing and building AI-powered tooling to enhance data quality.
Responsibilities
- Design and build scalable data ingestion pipelines that onboard new provider data sources — EHR brand files, national registries, partner feeds — and transform them into standardized FHIR resources (Practitioner, Organization, PractitionerRole, Endpoint, Location)
- Own data quality end-to-end: define validation rules, confidence scores, and quality thresholds; build automated monitoring and alerting; identify and resolve issues before they reach users
- Build AI-powered data tooling — entity resolution across duplicate provider records, practitioner-to-organization relationship inference, specialty classification from unstructured data, and quality scoring that quantifies how much you trust a record
- Operate and evolve existing Spark-based pipelines orchestrated by Prefect on AWS, improving reliability, observability, and onboarding speed so new data sources go from raw files to searchable records in days, not weeks
- Establish data governance standards for the provider directory: schemas, staging processes, and refinement workflows that turn messy inputs into trustworthy outputs
- Partner with Analytics to build data quality dashboards and reporting. Partner with Product and Business teams to prioritize which data sources and quality improvements have the highest user impact
- Lead incident response when data issues arise — stale records, broken pipelines, source regressions — and build the observability to catch problems before users do
Skills
- 5+ years of experience in building and operating data-intensive systems at scale
- Deep Python proficiency
- Comfortable with Spark or Databricks for distributed data processing
- Experience with workflow orchestration tools like Prefect or Airflow
- Sound data and storage instincts
- Experience with both relational and document databases (we run MongoDB and OpenSearch)
- Use of AI tools like Claude as a natural part of your development workflow
- Cloud-native fundamentals: AWS (S3, ECS/EKS, Lambda), Docker, Kubernetes, CI/CD with GitHub Actions
- Strong ownership instincts
- Clear communication skills
- Healthcare or FHIR experience is a strong plus and something you'll go deep on here
Benefits
- Stock options
- Benefits
- Incentive pay for eligible roles
Company Overview