This is a fully remote position, open to applicants in India.

📋 Description

• Execute pre-training of AI models on extensive, distributed servers outfitted with thousands of NVIDIA GPUs.

• Create, prototype, and scale innovative architectures to improve model intelligence.

• Conduct experiments independently and collaboratively, analyze findings, and refine methodologies for peak performance.

• Explore, troubleshoot, and enhance both model efficiency and computational performance.

• Contribute to the evolution of training systems to guarantee smooth scalability and efficiency on designated platforms.

⛳️ Requirements

• A degree in Computer Science or a related discipline.

• Preferably a PhD in NLP, Machine Learning, or a related area, backed by a strong record in AI R&D (including notable publications in A* conferences).

• Practical experience in contributing to large-scale LLM training initiatives on extensive, distributed servers equipped with thousands of NVIDIA GPUs, ensuring scalability and significant improvements in model performance.

• Knowledge and hands-on experience with large-scale, distributed training frameworks, libraries, and tools.

• Profound understanding of cutting-edge transformer and non-transformer modifications aimed at boosting intelligence, efficiency, and scalability.

• Extensive expertise in PyTorch and Hugging Face libraries, with hands-on experience in model development, continual pretraining, and deployment.

🏝️ Benefits

• Flexible work arrangements.

• Professional development opportunities.

AI Research Engineer – Pre training

📋 Description

⛳️ Requirements

🏝️ Benefits

People also viewed

AI Research Engineer, Model Compression – Quantization

Clinical AI Research Lead

AI Research Engineer – Pre-training, LLM, Multi-Modal

Clinical AI Research Assistant

ML Researcher

AI Researcher

Never miss a great job!