[Remote] Lead Data Science Engineer
Note: The job is a remote job and is open to candidates in USA. Health Care Service Corporation is a purpose-driven company that invests in the professional development of its employees. The Lead Data Science Engineer will be responsible for engineering work necessary for the creation, deployment, and management of AI capabilities, ensuring data quality and optimizing data pipelines.
Responsibilities
- Ensuring data quality
- Creation of new data pipelines
- Optimization and management of existing data pipelines
- Ingestion and curation of data sources for Gen AI purposes (including chunking/embedding strategies for RAG system)
- AI Agent delivery
- Prompt Engineering
- Selection and configuration of AI-specific tools and platforms
- Management and monitoring of AI models through MLOps tools and model ops practices
- Operationalizing AI capabilities working closely with a larger team
Skills
- Bachelor degree and 5 years of work experience in a computer science, engineering, or related field OR Master's degree and 4 years of work experience in a computer science, engineering, or related field OR Ph.D. and 2 years of work experience in a computer science, engineering, or related field
- Learning and growth mindset
- Customer-focused
- Interpersonal, verbal and written communication skills
- Must demonstrate proficiency in at least five and mastery in one of the following six areas: data analysis and relational-style query languages; data pipelining and ETL; working with semi structured and unstructured data; a high- level programming language; distributed computing; understanding of healthcare
- Proficiency in iterative development practices
- Independently delivering or leading the delivery of data engineering solutions for multiple complex analytics or data science projects and products
- A track record of independently delivering or leading the delivery of ML engineering capabilities
- Experience in Python-based Data Science frameworks (LangChain, LangGraph, LangFuse)
- Experience in Model evaluation and deployment
- Experience in data curation, prep, training, and fine-tuning of Models
- Experience in evaluation frameworks
- Experience in prompt engineering
- Experience in working with multiple Models
- Master degree in a computational field, or Bachelor degree with significant healthcare experience
- Understanding PySpark / Databricks to efficiently work with large data sets
- Azure Cloud Infrastructure / Deployment with emphasis on AI related tooling, Azure ML, Azure OpenAI, etc
- Experience in Observability Frameworks and Framework Operationalization
- Experience in creation of knowledge graph database (neo4J)
- Experience in working with Small Language Models or custom Models
Benefits
- Health and wellness benefits
- 401(k) savings plan
- Pension plan
- Paid time off
- Paid parental leave
- Disability insurance
- Supplemental life insurance
- Employee assistance program
- Paid holidays
- Tuition reimbursement
- Plus other incentives
- Annual incentive bonus plan subject to the terms and the conditions of the plan
Company Overview