
AI Data Engineer
Posted 2 days ago

Posted 2 days ago
This is a fully remote position, open to applicants in New Hampshire, +4 more states.
• Collaborate with data scientists and machine learning engineers to comprehend data requirements for LLM and machine learning model fine-tuning.
• Design, construct, and maintain scalable data pipelines to ingest, process, and store extensive and varied healthcare datasets.
• Implement strong data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.
• Establish robust data cleaning, validation, and transformation procedures to guarantee data quality and integrity.
• Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.
• Collaborate with the team to identify and acquire new data sources, ensuring compliance with pertinent healthcare regulations (e.g., HIPAA).
• Monitor the performance of data pipelines, troubleshoot issues, and implement optimizations to enhance efficiency and reliability.
• Document data engineering processes, data models, and data dictionaries.
• Stay informed about the latest advancements in data engineering, big data technologies, and machine learning.
• Required
• - Bachelor's degree in Computer Science, Engineering, or a related discipline.
• - Demonstrated experience as a Data Engineer, with an emphasis on big data technologies.
• - Strong expertise in programming languages such as Python, Scala, or Java.
• - Extensive experience with data warehousing, ETL processes, and data modeling.
• - Familiarity with major cloud service providers (e.g., AWS, GCP, Azure) and their data storage and processing services.
• - Practical experience with big data frameworks like Apache Spark for distributed processing.
• - Exceptional problem-solving abilities and the capability to work both independently and as part of a team.
• - Strong communication and interpersonal skills.
• Preferred
• - Master’s degree in a related field.
• - Experience with healthcare data and a solid understanding of healthcare data standards (e.g., FHIR, HL7).
• - Familiarity with machine learning concepts and LLM fine-tuning processes.
• - Experience with data orchestration tools (e.g., Apache Airflow).
• Work Authorization:
• - Must be a US Citizen, Green Card holder, or currently in the US with a valid H1B visa.
• Why Join Us?
• Joining C the Signs is not merely about developing AI; it’s about molding the future of healthcare. If you are a technical leader who firmly believes in the potential of AI to save lives and possesses the capability to implement it on a large scale, this is your chance to make a significant, global impact.
• Benefits:
• - Competitive salary and benefits package.
• - Flexible working arrangements (remote or hybrid options available).
• - Opportunity to work on transformative AI technology that directly affects patient outcomes.
• - Join a team that blends cutting-edge innovation with a mission to save lives and enhance health equity.
• - Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Instacart
CLASP
Tailor
Get handpicked remote jobs straight to your inbox weekly.