
AI Research Engineer – Pre-training, LLM, Multi-Modal
Posted Jun 11

Posted Jun 11
This is a fully remote position, open to applicants in India.
• Facilitate foundational pre-training for Large Language Models (LLMs) and Multi-Modal models.
• Develop, prototype, and enhance innovative architectures, tokenizers, and alignment layers.
• Acquire, filter, and curate extensive textual and multi-modal datasets.
• Conduct experiments independently and collaboratively, analyzing the outcomes.
• Explore, troubleshoot, and resolve inefficiencies in model performance.
• Play a key role in the progression of distributed training systems.
• A degree in Computer Science or a related discipline.
• PhD in Natural Language Processing (NLP), Machine Learning, or a closely related area is preferred.
• Practical experience in contributing to large-scale LLM or Multi-Modal pre-training initiatives.
• Familiarity and hands-on experience with large-scale, distributed training frameworks.
• In-depth knowledge of cutting-edge transformer and non-transformer modifications.
• Strong proficiency in PyTorch and Hugging Face libraries.
• Opportunities for professional development.
• Flexibility for remote work.
Sophos
NVIDIA
Geomagical Labs
Cotiviti
Get handpicked remote jobs straight to your inbox weekly.