This is a fully remote position, open to applicants in Argentina.

📋 Description

• Execute pre-training AI models on extensive, distributed servers outfitted with thousands of NVIDIA GPUs.

• Create, prototype, and scale innovative architectures to improve model intelligence.

• Conduct experiments both independently and collaboratively, analyze outcomes, and enhance methodologies for maximum performance.

• Explore, troubleshoot, and enhance model efficiency and computational performance.

• Play a key role in advancing training systems to ensure smooth scalability and efficiency on designated platforms.

⛳️ Requirements

• A degree in Computer Science or a related discipline.

• Preferably a PhD in NLP, Machine Learning, or a related area.

• Practical experience contributing to large-scale LLM training processes on extensive, distributed servers equipped with numerous NVIDIA GPUs.

• Knowledge of large-scale, distributed training frameworks, libraries, and tools.

• In-depth understanding of cutting-edge transformer and non-transformer modifications.

• Strong proficiency in PyTorch and Hugging Face libraries, with hands-on experience in model development, continual pretraining, and deployment.

🏝️ Benefits

• Flexible working arrangements.

• Opportunities for professional development.

AI Research Engineer, Model Compression – Quantization

Switzerland OnlyFull-timeAI Research Scientist

10 hours ago

Apply

Insight Timer6 days ago

Insight Timer

Clinical AI Research Lead

Australia OnlyFull-timeAI Research Scientist

6 days ago

Apply

Tether.to6 days ago

Tether.to