This is a fully remote position, open to applicants in Sweden.

📋 Description

• Propel innovation in the development of architecture for AI models.

• Carry out pre-training of AI models on extensive, distributed servers that utilize thousands of NVIDIA GPUs.

• Create, prototype, and scale groundbreaking architectures to improve model intelligence.

• Execute experiments both independently and collaboratively, analyze the outcomes, and enhance methodologies for peak performance.

• Explore, troubleshoot, and enhance both model efficiency and computational performance.

• Aid in the evolution of training systems to guarantee seamless scalability and efficiency across target platforms.

⛳️ Requirements

• A degree in Computer Science or a related discipline.

• Preferably a PhD in NLP, Machine Learning, or a comparable field, supported by a robust history in AI R&D (including strong publications in A* conferences).

• Practical experience in contributing to large-scale LLM training operations on extensive, distributed servers with thousands of NVIDIA GPUs.

• Acquainted with and have hands-on experience with large-scale, distributed training frameworks.

• Profound understanding of cutting-edge transformer and non-transformer modifications designed to enhance intelligence, efficiency, and scalability.

• Significant expertise in PyTorch and Hugging Face libraries with practical experience in model development, continual pretraining, and deployment.

🏝️ Benefits

• 100% Remote work.

• Opportunity to collaborate with a global team.

• Access to cutting-edge technology.

• Opportunities for professional development.

AI Research Engineer – Pre training

📋 Description

⛳️ Requirements

🏝️ Benefits

People also viewed

AI Research Engineer, Model Compression – Quantization

Clinical AI Research Lead

AI Research Engineer – Pre-training, LLM, Multi-Modal

ML Researcher

Clinical AI Research Assistant

AI Researcher

Never miss a great job!