Remotery

Member of Engineering – Reinforcement Learning Infrastructure

Posted Jun 3

This is a fully remote position, open to applicants in Europe.

📋 Description

• Stay updated with the latest research and have a solid understanding of the current advancements in LLMs, RL, and code generation.

• Create techniques for optimizing training and inference processes to achieve high throughput.

• Design data control systems within an RL pipeline that determine what the model observes and when.

• Identify and troubleshoot instances where infrastructure choices are adversely affecting learning dynamics.

• Develop observability tools that highlight when a system-level issue is the underlying cause of a training regression.

• Contribute to the construction of robust, adaptable, and scalable RL pipelines.

• Enhance performance across the entire stack, including networking, memory, compute scheduling, and I/O.

• Produce high-quality, practical code.

• Collaborate with the team: plan future actions, engage in discussions, and maintain constant communication.


⛳️ Requirements

• Proven experience with LLMs and workflows following model training.

• Knowledge of Reinforcement Learning principles and awareness of its primary challenges.

• Strong foundation in software engineering (testing, code reviews, debugging complex systems).

• Proficient in Python, with expertise in concurrency, asynchronous programming, multiprocessing, and performance enhancement.

• Familiarity with deep learning frameworks (such as PyTorch or JAX) and RL workflows (rollouts, replay buffers, policy updates).

• Experience in designing and maintaining distributed RL training systems.

• Background in large-scale LLM training infrastructure.

• Proficient with profiling tools across the stack (e.g., py-spy).

• Familiarity with inference stacks (e.g., vLLM).

• Preferred: Contributions to open-source RL or distributed ML projects.


🏝️ Benefits

• Fully remote work with flexible hours.

• 37 days of vacation and holidays each year.

• Health insurance allowance for you and your dependents.

• Equipment provided by the company.

• Wellbeing, continuous learning, and home office allowances.

• Regular team gatherings.

• A diverse and inclusive people-first culture.

People also viewed

Spread Tecnologia33 min ago

PL/SQL Developer, PL

BR flagBrazil OnlyFull-timeSoftware Engineer
ApplyView job
Adistec51 min ago

Engineering Sales Specialist

EC flagEcuador OnlyFull-timeSoftware Engineer
ApplyView job
Strix PL51 min ago

Senior Symfony Developer

PL flagPoland OnlyFull-timeSoftware Engineer
ApplyView job
Tether.to13 hours ago

Bare Developer

DK flagDenmark OnlyFull-timeSoftware Engineer
ApplyView job
SD Solutions13 hours ago

Mechanical Designer – Ventilation & Engineering

UA flagUkraine OnlyFull-timeSoftware Engineer
ApplyView job
SIS International Research & Strategy Consulting13 hours ago

Survey Programmer – Ops, Scripting

IN flagIndia OnlyFull-timeSoftware Engineer₹600k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers