
Senior Performance & Infrastructure Engineer - HPC
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Netherlands.
• Analyze, profile, fine-tune, and enhance the Linux kernel and its subsystems (CPU scheduling, memory management, networking stack) specifically for GPU clusters and InfiniBand technologies.
• Diagnose and rectify intricate performance issues.
• Incorporate and validate new GPU hardware and infrastructure (KVM/QEMU, PCIe devices, Kubernetes).
• Enhance monitoring, alerting, and automation processes for extensive, distributed systems.
• Occasionally support clients in optimizing their workloads.
• Strong understanding of Linux internals, with experience in kernel tracing, profiling, and tuning (e.g., perf, ftrace, eBPF, sysctl, kgdb, etc.).
• Exceptional programming capabilities in C or C++ for system-level development, along with a solid understanding of data structures and algorithms.
• Background in performance optimization (e.g., high-load/high-throughput, low-latency, low-jitter, memory bypasses, zero-copy, lock-free techniques, synchronization across large-scale clusters, etc.).
• Proficiency in scripting or development using Go, Python, or equivalent languages.
• Flexible working arrangements.
• A dynamic and collaborative work atmosphere that promotes initiative and innovation.
Pagefreezer
Orro Group
Feldera
Webflow
Get handpicked remote jobs straight to your inbox weekly.