
AI Research Engineer – Pre training
Posted May 19

Posted May 19
This is a fully remote position, open to applicants in Netherlands.
• Execute pre-training processes for AI models on expansive, distributed servers featuring thousands of NVIDIA GPUs.
• Develop, prototype, and scale innovative architectures to improve model intelligence.
• Carry out experiments independently and in collaboration with others, analyze outcomes, and refine methodologies to achieve optimal performance.
• Explore, troubleshoot, and enhance both model efficiency and computational performance.
• Play a key role in advancing training systems to guarantee seamless scalability and efficiency across target platforms.
• A degree in Computer Science or a related discipline.
• Preferably a PhD in NLP, Machine Learning, or a related area, supported by a strong track record in AI R&D (with notable publications in A* conferences).
• Practical experience in contributing to large-scale LLM training operations on expansive, distributed servers equipped with thousands of NVIDIA GPUs, ensuring scalability and significant advancements in model performance.
• Familiarity and hands-on experience with large-scale, distributed training frameworks, libraries, and tools.
• In-depth understanding of cutting-edge transformer and non-transformer modifications aimed at boosting intelligence, efficiency, and scalability.
• Strong proficiency in PyTorch and Hugging Face libraries, with practical experience in model development, continual pretraining, and deployment.
• Flexible work arrangements
• Professional development opportunities
Tether.to
Insight Timer
Tether.to
Get handpicked remote jobs straight to your inbox weekly.