
AI Research Engineer – Pre training
Posted May 19

Posted May 19
This is a fully remote position, open to applicants in Sweden.
• Propel innovation in the development of architecture for AI models.
• Carry out pre-training of AI models on extensive, distributed servers that utilize thousands of NVIDIA GPUs.
• Create, prototype, and scale groundbreaking architectures to improve model intelligence.
• Execute experiments both independently and collaboratively, analyze the outcomes, and enhance methodologies for peak performance.
• Explore, troubleshoot, and enhance both model efficiency and computational performance.
• Aid in the evolution of training systems to guarantee seamless scalability and efficiency across target platforms.
• A degree in Computer Science or a related discipline.
• Preferably a PhD in NLP, Machine Learning, or a comparable field, supported by a robust history in AI R&D (including strong publications in A* conferences).
• Practical experience in contributing to large-scale LLM training operations on extensive, distributed servers with thousands of NVIDIA GPUs.
• Acquainted with and have hands-on experience with large-scale, distributed training frameworks.
• Profound understanding of cutting-edge transformer and non-transformer modifications designed to enhance intelligence, efficiency, and scalability.
• Significant expertise in PyTorch and Hugging Face libraries with practical experience in model development, continual pretraining, and deployment.
• 100% Remote work.
• Opportunity to collaborate with a global team.
• Access to cutting-edge technology.
• Opportunities for professional development.
Tether.to
Insight Timer
Tether.to
Get handpicked remote jobs straight to your inbox weekly.