Remotery

Research Lead – Principal Scientist, Manager – Alignment, Reinforcement Learning

Posted Jun 3

This is a fully remote position, open to applicants in Germany.

📋 Description

• Take charge of the post-training strategy for model development.

• Create innovative algorithms that enhance model reliability, controllability, and alignment.

• Design and conduct experiments that influence model behavior, robustness, and reasoning quality.

• Oversee, mentor, and develop a team of AI scientists.


⛳️ Requirements

• In-depth hands-on experience in reinforcement learning for foundational models.

• Proficient in post-training techniques (RLHF, RLAIF, DPO, PPO, or similar methodologies).

• Demonstrated experience in leading or mentoring technical research teams.

• Strong understanding of model behavior, alignment issues, and post-training trade-offs.

• Experience in designing evaluation systems.

• Capacity to convey complex technical trade-offs effectively.


🏝️ Benefits

• Comprehensive benefits package.

People also viewed

Eurofins6 days ago

Research Fellow, Computational Chemistry

FR flagFrance OnlyFull-timeResearch Scientist
ApplyView job
Parloa6 days ago

Principal Applied Scientist

DE flagGermany OnlyFull-timeResearch Scientist
ApplyView job
American Institutes for Research6 days ago

Senior Researcher, Employment and Economic Opportunity

US flagUnited States OnlyPart-timeResearch Scientist$60 – $74/hour
ApplyView job
heimatwurzeln e.V.Jun 4

Senior Researcher – Public Opinion, Political Attitudes

DE flagGermany OnlyFull-timeResearch Scientist
ApplyView job
NVIDIAJun 3

Senior Research Scientist

DE flagGermany OnlyFull-timeResearch Scientist
ApplyView job
Remote RecruitmentJun 3

Senior Researcher – Executive Search

ZA flagSouth Africa OnlyFull-timeResearch Scientist
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers