
Senior GPU Infrastructure Engineer
Posted 11 hours ago

Posted 11 hours ago
This is a fully remote position, open to applicants in California.
• Assist in the development and expansion of Hyperbolic's GPU Cloud Marketplace.
• Create a solution for multi-tenancy provisioning and virtualization.
• Convert raw GPUs sourced from various global suppliers into a programmable and orchestrated resource pool.
• Provide services to thousands of AI developers and researchers.
• Engage at the forefront of cloud infrastructure technology.
• Develop the essential orchestration layer that allows the platform to achieve up to 75% cost savings compared to conventional cloud service providers.
• Comprehensive knowledge of bare-metal provisioning and lifecycle management, encompassing IPMI/Redfish, BMC-based remote management, PXE boot, and automated operating system deployment workflows.
• In-depth understanding of GPU scheduling and orchestration, including awareness of GPU types, memory management, topology considerations, placement strategies for multi-GPU tasks, and minimizing fragmentation.
• Strong skills in infrastructure and DevOps engineering with expertise in Terraform or Pulumi, CI/CD for infrastructure, secrets management, configuration management, and implementation of observability stacks.
• Experience with storage and data infrastructure tailored for AI/ML workloads, such as object storage, high-IOPS block storage, and distributed file systems for training data and checkpoints.
• Proficiency in API design and cloud-init for automated provisioning and configuration tasks.
• Solid grasp of GPU architecture, CUDA, and GPU compute optimization techniques.
• A highly collaborative team player with outstanding communication skills that bridge technical and non-technical stakeholders.
• Proven capacity to effectively collaborate with hardware vendors and vendor engineering teams to troubleshoot problems and enhance integrations.
• Experience in building and scaling cloud infrastructure or distributed systems within production settings.
• Competitive salary and performance-based bonuses.
• Opportunities for professional development and career advancement.
• Flexible work hours and remote work options.
• Health, dental, and vision insurance plans.
• A supportive and inclusive work environment.
Cision France
Navigate Power
Get handpicked remote jobs straight to your inbox weekly.