
Principal Data Scientist, Health Informatics
Posted 1 day ago

Posted 1 day ago
This is a fully remote position, open to applicants in California, +12 more states.
• Take ownership of clinical data quality across claims, EHR, and ADT: Establish standards for structuring, normalizing, and validating clinical data as modeling inputs from payer claims (medical, pharmacy, eligibility), EHR data (Epic, Cerner, Athena), and real-time ADT feeds. Possess extensive knowledge of EHR data formats (FHIR, HL7, C-CDA) and understand how data from systems such as Epic, Cerner, and Athena relates to clinical realities. Uphold high standards for clinical accuracy and completeness across all three data sources.
• Develop and deploy production ML/AI models: Create, validate, and implement models for risk stratification, care gap prediction, treatment effect estimation, and applications of LLM/foundation models — ensuring rigor in areas such as leakage, calibration, fairness, and clinical face validity.
• Utilize health economics and outcomes methods: Convert raw clinical and claims data into decision-quality evidence through techniques like risk adjustment, utilization measurement, cost attribution, quasi-experimental evaluation, and outcomes measurement in accordance with CMS, NCQA, and MCO reporting standards.
• Enhance machine and AI products: Provide senior modeling expertise for the product roadmap, ensuring clinical and methodological integrity of all deliverables.
• Establish standards and mentor others: Make architectural decisions, foster alignment among data science, engineering, product, and clinical stakeholders, and guide junior data scientists to elevate the team's technical capabilities.
• Expertise in Healthcare Data: Extensive hands-on experience with claims, EHR, and ADT data, along with a strong understanding of clinical terminologies (ICD-10, SNOMED CT, LOINC, RxNorm, CPT/HCPCS) and value set curation.
• Proficiency in Standards: Practical experience with healthcare data standards and exchange formats — FHIR, HL7v2, and C-CDA.
• Educational Background: A Master’s degree in Data Science, Biostatistics, Health Informatics, Computer Science, or a related discipline.
• Python Expertise: 7-8+ years of hands-on experience in Python, including proficiency in data science and ML libraries.
• Experience in Applied ML/AI: Proven track record of building, validating, and deploying production ML models on healthcare data, overseeing the entire process from development to deployment and maintenance in a live setting. Familiarity with ML pipelines, model versioning, and reproducible workflows at scale.
• Project Management Skills: Demonstrated capability to independently manage complex technical projects, align various stakeholders, and meet deadlines.
• Stock Options: Opportunity to invest in the company’s growth.
• Work-from-Home Stipend: A dedicated stipend for your initial year to assist in setting up your home office.
• Medical, Vision, and Dental Coverage: Comprehensive plans to ensure the health of you and your family.
• Life Insurance: Basic life insurance for your peace of mind.
• Paid Time Off: 20 vacation days accrued throughout the year, in addition to 11 paid holidays.
• Parental Leave: 16 weeks of paid leave for birthing parents after six months of employment, and 8 weeks of bonding leave for non-birthing parents.
• Retirement Savings: Access to a 401(k) plan with company contributions, subject to a vesting schedule.
• Commuter Benefits: Convenient options to support your commuting needs.
• Professional Development Stipend: A dedicated stipend to support your professional growth and development.
Binance.US
10x Genomics
Dynatron Software, Inc.
Circle
Get handpicked remote jobs straight to your inbox weekly.