
Staff Software Engineer, Data Extraction
Posted 9 hours ago

Posted 9 hours ago
This is a fully remote position, open to applicants in New York.
β’ Take ownership of the architecture for clinical data extraction. Design and enhance systems that gather, process, and standardize both structured and unstructured clinical data from various healthcare sources.
β’ Develop document intelligence pipelines. Create services that extract valuable information from PDFs, scanned documents, images, and clinical records.
β’ Lead initiatives in healthcare data modeling. Formulate strategies for converting raw clinical artifacts into structured, queryable datasets.
β’ Create scalable ingestion services. Design robust systems capable of processing substantial volumes of healthcare records with high accuracy and reliability.
β’ Construct scalable data extraction systems. Utilize the best available technologies and architectural methods to enhance information extraction, classification, and normalization across a variety of healthcare data sources.
β’ Set data quality standards. Develop validation frameworks, monitoring systems, and feedback mechanisms that enhance extraction accuracy over time.
β’ Collaborate and influence across clinical, product, engineering, data, and operational domains: Aligning on business objectives and translating them into technical systems that facilitate provider evidence generation.
β’ Spearhead technical strategy for unstructured healthcare data. Assess technologies, standards, and architectural strategies that expedite the adoption of clinical data.
β’ Mentor engineers throughout the organization. Elevate engineering standards through design reviews, technical leadership, and collaborative efforts.
β’ 8+ years of experience in building backend software systems within production settings.
β’ Proficiency with healthcare data standards such as HL7, FHIR, CDA, or EHR integrations.
β’ Understanding of medical records, claims, or healthcare operations.
β’ Experience in designing large-scale data processing or document processing platforms, particularly involving clinical data - familiarity with medical terminology and coding practices.
β’ Strong expertise in Python and SQL, or comparable backend programming languages.
β’ Experience with unstructured data and NLP systems.
β’ Proven track record of designing systems that function reliably with imperfect or incomplete datasets.
β’ Experience in making architectural decisions that impact multiple teams or business functions.
β’ Comfortable navigating ambiguous situations where requirements adapt as the business evolves.
β’ Demonstrated commitment to ownership and achieving results rather than merely completing tasks.
β’ Competitive compensation, including equity.
β’ Comprehensive health, dental, and vision insurance.
β’ Retirement savings plan through 401(k).
β’ Flexible time off.
β’ Opportunities for company-wide connection and events.
VPS
Tango
Influur
Salesloft
Get handpicked remote jobs straight to your inbox weekly.