
Senior Data Scientist
Posted Jun 20

Posted Jun 20
This is a fully remote position, open to applicants in United States.
β’ Concentrate on comprehending current models, evaluating their performance, choosing the most suitable architectures, and refining them to fulfill specific domain and business requirements, including RAG-based applications.
β’ Work in close collaboration with data engineering, product, and domain teams to transform real-world research challenges into scalable, model-driven solutions.
β’ Enhance and fine-tune LLMs and domain-specific variants utilizing proprietary datasets to meet precision and recall objectives.
β’ Assess model performance against key metrics and benchmarks, recognizing strengths, weaknesses, and areas for improvement.
β’ Implement and operationalize LLM-based and RAG systems that improve Braid-powered products.
β’ Partner with data engineering to guarantee scalable and efficient model training, evaluation, and deployment pipelines.
β’ Evaluate and choose models that are best suited to domain-specific needs.
β’ Collaborate with clinical and operational specialists to convert research and trial challenges into quantifiable model evaluation frameworks and optimization strategies.
β’ Perform model interpretability and bias analyses to ensure fairness, transparency, and adherence to governance standards.
β’ Document methodologies and validation outcomes to support internal governance, reproducibility, and audit preparedness.
β’ Contribute to reusable fine-tuning workflows, evaluation frameworks, and model monitoring pipelines.
β’ Remain updated on advancements in LLM optimization, retrieval augmentation, and multi-modal learning.
β’ MS in Machine Learning, Computer Science, or a related quantitative field, or equivalent relevant work experience.
β’ Over 5 years of practical experience in developing and fine-tuning ML or LLM models.
β’ Proven expertise in Python, along with experience and knowledge of a commercial framework such as PyTorch.
β’ Practical experience in developing, managing, and troubleshooting workflows within Databricks for data engineering, analytics, and machine learning projects.
β’ Documented strong understanding of the ML lifecycle.
β’ Familiarity with embeddings and retrieval-augmented generation (RAG).
β’ Health coverage.
β’ Paid holidays.
β’ Variable bonus.
Zeta Global
Humana
Binance.US
10x Genomics
Get handpicked remote jobs straight to your inbox weekly.