
AI Research Engineer – Pre training
Posted May 19

Posted May 19
This is a fully remote position, open to applicants in Argentina.
• Execute pre-training AI models on extensive, distributed servers outfitted with thousands of NVIDIA GPUs.
• Create, prototype, and scale innovative architectures to improve model intelligence.
• Conduct experiments both independently and collaboratively, analyze outcomes, and enhance methodologies for maximum performance.
• Explore, troubleshoot, and enhance model efficiency and computational performance.
• Play a key role in advancing training systems to ensure smooth scalability and efficiency on designated platforms.
• A degree in Computer Science or a related discipline.
• Preferably a PhD in NLP, Machine Learning, or a related area.
• Practical experience contributing to large-scale LLM training processes on extensive, distributed servers equipped with numerous NVIDIA GPUs.
• Knowledge of large-scale, distributed training frameworks, libraries, and tools.
• In-depth understanding of cutting-edge transformer and non-transformer modifications.
• Strong proficiency in PyTorch and Hugging Face libraries, with hands-on experience in model development, continual pretraining, and deployment.
• Flexible working arrangements.
• Opportunities for professional development.
Tether.to
Insight Timer
Tether.to
Get handpicked remote jobs straight to your inbox weekly.