
Senior HPC and AI Networking Performance Engineer
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Germany.
• Conduct research and gain experience with AI workloads and deep learning models specifically designed for large-scale LLM training on NVIDIA supercomputers, emphasizing high-performance networking.
• Perform benchmarking, profiling, and analysis of performance to identify bottlenecks and areas for enhancement and optimization, particularly focusing on networking components.
• Develop and implement performance analysis tools.
• Work collaboratively with various teams, from hardware to software, to deliver insights on performance analysis.
• Establish performance testing strategies, set performance expectations for emerging technologies and solutions, and strive to achieve performance targets.
• Bachelor’s degree in Computer Science or Software Engineering.
• Over 6 years of experience in high-performance networking (RDMA, MPI, NCCL).
• Proven skills and methodologies in performance analysis.
• Familiarity with NVIDIA GPUs, CUDA libraries, and deep learning frameworks such as TensorFlow or PyTorch.
• Quick and self-motivated learner with strong analytical and problem-solving abilities.
• Proficient in programming languages: Python, Bash, and C.
• Experience with Linux operating system distributions.
• Effective team player with strong communication and interpersonal skills.
• NVIDIA is dedicated to promoting a diverse work environment and is proud to be an equal opportunity employer.
EverAI
10x.Team
EverAI
Invisible Technologies
Get handpicked remote jobs straight to your inbox weekly.