
Principal Operations Engineer – Hardware, Data Center Operations
Posted Jun 20

Posted Jun 20
This is a fully remote position, open to applicants in United States.
• Act as the leading technical authority for the operational hardware fleet within our expansive AI data center portfolio.
• Ensure the effective operation, maintenance, and ongoing enhancement of GPU systems, servers, and related hardware deployed at scale.
• Conduct site assessments and operational audits.
• Propel the technical readiness of teams prior to site activation.
• Evaluate hardware platforms and integration designs from an operational perspective.
• Relay operational insights back to the hardware engineering, deployment, and supply chain teams as we transition towards a productized, standardized build model.
• Over 10 years of practical experience in managing mission-critical hardware infrastructure, including at least 5 years in a senior technical position on a site, campus, or fleet.
• Experience in data center operations is highly preferred; experience in hyperscale, large HPC, cloud, or other mission-critical computing infrastructures is also considered.
• Extensive knowledge of GPU systems, server platforms, storage infrastructure, firmware lifecycle management, and hardware diagnostics — acquired through hands-on experience rather than theoretical understanding.
• Proven capability to draft, approve, and implement high-risk MOPs and change records in active production environments.
• A successful history of leading root cause analyses on significant hardware incidents and ensuring corrective actions are fully executed.
• Demonstrated experience in holding OEMs, ODMs, service vendors, and deployment partners accountable — adept at enforcing standards while maintaining positive relationships.
• Excellent written communication skills: operational health assessments, RCAs, procedure reviews, and design review feedback come naturally.
• Comfortable operating as the senior technical representative across operations, hardware engineering, networking, facilities, supply chain, and customer-facing teams.
• Readiness to travel extensively across the fleet, with travel requirements between 50-75%.
• Competitive total compensation package (salary + equity).
• Retirement or pension plan, consistent with local standards.
• Health, dental, and vision insurance.
• Generous PTO policy, aligned with local customs.
ABC Legal Services
Marsh McLennan
Stewart Title
CCS Fundraising
Get handpicked remote jobs straight to your inbox weekly.