[Remote] Lead Data Scientist
Note: The job is a remote job and is open to candidates in USA. MDCalc is a leading medical reference tool used by over 65% of physicians worldwide, focused on improving patient outcomes through clinical decision tools. They are seeking a Lead Data Scientist to drive product innovation and enhance clinical outcomes by applying advanced statistical methods and predictive analytics in collaboration with cross-functional teams.
Responsibilities
- Define and lead analytical projects that influence product direction, business strategy, and clinical impact
- Develop predictive models and behavioral analyses that help identify which clinicians are using MDCalc, how they engage with clinical tools, and how usage patterns evolve across specialties, care settings, and patient conditions
- Analyze large and complex healthcare datasets, including clinician engagement, product usage patterns, and provider behavior, to uncover patterns, opportunities, and actionable insights
- Design and implement robust data pipelines, workflows, and dashboards that scale with MDCalc’s growth
- Partner with product managers, engineers, and clinical experts to translate data into product requirements and measurable outcomes
- Establish experimentation frameworks and lead A/B testing to optimize product performance
- Set standards for data integrity, reproducibility, and quality, ensuring confidence in insights across the organization
- Act as a thought partner for leadership, communicating findings and recommendations clearly to drive strategic decisions
Skills
- Bachelor's, Master's, or PhD in Data Science, Statistics, Computer Science, Applied Mathematics, or a related field
- 10+ years of experience in applied data science, statistical modeling, or predictive analytics, ideally working with healthcare, clinical, or provider behavior datasets
- Strong foundation in statistical methods, predictive modeling, and machine learning concepts
- Proficiency in Python or R and associated data science libraries such as pandas, scikit-learn, statsmodels, NumPy, or similar
- Strong SQL skills and experience working with large datasets
- Experience developing, evaluating, and improving machine learning models in real-world applications
- Experience designing and analyzing experiments and interpreting results with statistical rigor
- Ability to communicate complex technical concepts clearly and effectively
- Strong ownership mindset and ability to operate independently in a fast-moving environment
- Familiarity with data visualization and BI tools such as Tableau or Looker, with the ability to tell compelling data stories to diverse audiences
- Expertise in Python (pandas, NumPy, scikit-learn) and SQL, with experience building production-grade predictive or statistical models
- Deep knowledge of machine learning techniques such as regression, classification, clustering, recommendation systems, and causal inference
- Proven track record of leading data initiatives from concept to implementation and influencing product roadmaps
- Experience designing experiments, running A/B tests, and measuring product success through data
- Experience working with healthcare, clinical, or life sciences datasets such as provider behavior, claims data, utilization data, or clinical decision support platforms
Benefits
- Medical, Dental, & Vision coverage, with option to extend to your dependents
- Company-sponsored short-term insurance
- Fully-paid 8 week parental leave, after 6 months of employment
- Company-sponsored 401k, after 3 months of employment
- Unlimited vacation for salaried roles - we trust you to take the time you need
- Tri-annual company offsites to connect, reflect, and plan together
- Work from home monthly stipend
- Hybrid work environment with a great team office in Greenwich Village, NYC
- A culture of fun and motivated team members who believe in a greater mission here at MDCalc
Company Overview