Remotery

Research Scientist, LLM Evaluation – Post-Training

Posted 4 hours ago

This is a fully remote position, open to applicants in California, +1 more state.

📋 Description

• Establish and implement a comprehensive research agenda centered on LLM evaluation and post-training, with a focus on model enhancement driven by evaluation.

• Create experiments to investigate how different evaluation methodologies influence fine-tuning and post-training results.

• Develop and authenticate extensive evaluation frameworks for both LLM and multimodal systems.

• Spearhead research in cutting-edge evaluation areas, including long-context, cross-modal, and dynamic multi-turn evaluations.

• Examine model behaviors and identify failure patterns; provide actionable insights for model enhancement.

• Collaborate with Language Data Scientists to incorporate human-in-the-loop and synthetic data/evaluation methodologies.


⛳️ Requirements

• A Master's or PhD in Computer Science, Machine Learning, Statistics, Applied Mathematics, AI, or a related quantitative discipline (PhD is highly preferred).

• Over 5 years of pertinent experience in applied ML research or scientific research, with significant work in LLMs or foundational models (graduate research is applicable).

• Proven experience with LLM evaluation, benchmarking, alignment, post-training processes, or model quality research.

• A solid understanding of experimental design, statistical analysis, and scientific reasoning applicable to ML systems.

• Proficient in Python programming for research experimentation, data processing, evaluation pipelines, statistical analysis, and visualization.

• Practical experience with contemporary ML frameworks (PyTorch, Hugging Face, JAX/TensorFlow).


🏝️ Benefits

• Options for remote work.

• Opportunities for professional development.

People also viewed

Jade Biosciences4 hours ago

Principal Scientist, Immunology

US flagCalifornia, +1 more stateFull-timeResearch Scientist$175k – $190k/year
ApplyView job
Sophos4 hours ago

Senior Threat Researcher

GB flagUnited Kingdom OnlyFull-timeResearch Scientist
ApplyView job
SandboxAQ4 hours ago

Research Scientist, Battery Materials Simulation

US flagUnited States OnlyFull-timeResearch Scientist$112k – $210k/year
ApplyView job
SandboxAQ5 hours ago

Senior Research Scientist, Battery Materials Simulation

US flagUnited States OnlyFull-timeResearch Scientist$134.4k – $252k/year
ApplyView job
Kerr Dental19 hours ago

Principal Scientist, Translational Medicine, Preclinical Safety

US flagCalifornia OnlyFull-timeResearch Scientist$119.7k – $222.3k/year
ApplyView job
Syneos Health1 day ago

Principal Medical Scientist – Project Lead

PL flagPoland OnlyFull-timeResearch Scientist
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers