
AI Research Engineer – Pre training
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Italy.
• Execute pre-training of AI models on extensive, distributed servers that utilize thousands of NVIDIA GPUs.
• Create, prototype, and scale cutting-edge architectures to improve model intelligence.
• Conduct experiments independently and in collaboration with others, analyze outcomes, and enhance methodologies for peak performance.
• Explore, troubleshoot, and enhance both model efficiency and computational performance.
• Play a role in the evolution of training systems to guarantee seamless scalability and efficiency on designated platforms.
• A degree in Computer Science or a related discipline.
• Preferably a PhD in NLP, Machine Learning, or a similar field, supported by a strong history in AI research and development (with notable publications in A* conferences).
• Practical experience in contributing to large-scale LLM training processes on extensive, distributed servers utilizing thousands of NVIDIA GPUs.
• Familiarity with large-scale, distributed training frameworks, libraries, and tools.
• In-depth understanding of state-of-the-art transformer and non-transformer modifications aimed at boosting intelligence, efficiency, and scalability.
• Strong proficiency in PyTorch and Hugging Face libraries, with hands-on experience in model development, continuous pretraining, and deployment.
• Flexible working arrangements.
• Opportunities for professional development.
Tether.to
Insight Timer
Tether.to
Get handpicked remote jobs straight to your inbox weekly.