[Remote] Senior Data Scientist (Insurance Claims)
Note: The job is a remote job and is open to candidates in USA. CLARA Analytics is the leading AI as a service (AIaaS) provider that improves casualty claims outcomes for commercial insurance carriers and self-insured organizations. They are seeking a Senior Data Scientist to lead the development of their core Claim Clustering Platform, collaborating with various teams to ensure scalability, reliability, and continuous improvement.
Responsibilities
- Design and own the mathematical and technical framework for a scalable clustering engine that groups claims based on clinical, legal, and financial attributes
- Partner with Machine Learning Engineers to translate prototypes into optimized production systems, and work with MLOps to implement automated retraining, monitoring, and model lifecycle management
- Collaborate with Data and Application Engineering teams to define APIs and data contracts that power internal tools such as attorney and physician benchmarking, as well as fraud detection systems
- Develop advanced representations of claims using both structured data (e.g., ICD codes, geographic data, indemnity and legal costs) and unstructured data (e.g., adjuster notes, medical records, legal documents)
- Act as a subject matter expert within the broader engineering organization, ensuring alignment between data science initiatives, system architecture, and production reliability standards
Skills
- 4-7 years in Data Science with a strong track record of deploying models into production environments
- Deep experience with casualty claims, including at least 4 years of hands-on work in Workers' Compensation or General Liability. Familiarity with claim lifecycles, medical billing (ICD/CPT), litigation processes, and reserve dynamics is crucial
- Strong proficiency in Python and solid software engineering fundamentals, including system design, CI/CD pipelines, and API development/versioning
- Expertise in unsupervised learning techniques (clustering, dimensionality reduction) and NLP methods (transformers, embeddings, LLM-based approaches) for analyzing complex, unstructured data
- Proven ability to lead and execute projects across Data Science, Engineering, DevOps, and Product teams
Benefits
- Generously subsidized health insurance
- Employer-paid ancillary benefits
- Flex/unlimited PTO
- Fully remote
- 401k with company match
Company Overview
Company H1B Sponsorship