
Director, Operations SME – Data Centers – Critical Environment
Posted May 9

Posted May 9
This is a fully remote position, open to applicants in Texas.
• Assist in the development of solutions for managing clients' data center portfolios.
• Collaborate with technical transition leadership to ensure thorough due diligence and site startup is implemented as planned.
• Serve as the primary subject matter expert (SME) focused on critical facility operations, including electrical distribution, UPS systems, generators, cooling systems (CRAC/CRAH), and BMS/EPMS platforms.
• Possess a profound understanding of high-density data center layouts and advanced liquid cooling technologies, including operations and maintenance.
• Provide expert advice on incident response, root cause analysis (RCA), and risk mitigation strategies.
• Review and authorize MOPs, SOPs, and EOPs, ensuring they are technically accurate and in line with industry best practices.
• Lead technical assessments, audits, and operational readiness evaluations across data center locations.
• Assist in complex troubleshooting and high-severity incidents (SEVs) as the escalation authority.
• Implement and advise on training protocols for site teams.
• Propel the deployment of operational platforms for managing maintenance and operations of critical data center sites.
• Maintain ongoing collaboration with Account Directors and clients to ensure operations align with business objectives, SLAs, and reliability targets.
• Guarantee a comprehensive and well-planned handover to the ongoing account management team for operational continuity.
• Promote standardization and scalability of operating models across multiple sites or global accounts.
• Identify operational deficiencies and lead corrective action plans along with performance improvement initiatives.
• Act as a strategic advisor during contract solutioning, transitions, mobilizations, and expansions.
• Advocate for best practices in uptime, redundancy, and resilience across all data center environments.
• Encourage the adoption of reliability-centered maintenance (RCM), predictive maintenance, and condition-based monitoring.
• Analyze performance metrics to identify trends and proactively address risks.
• Lead initiatives aimed at enhancing energy efficiency, sustainability, and cost optimization.
• Ensure compliance with regulatory and industry standards (OSHA, NFPA, ISO, Uptime Institute guidelines, etc.).
• Assist in audit preparation, compliance reviews, and risk management frameworks.
• Develop and implement standardized operational controls and governance models.
• Assess and mitigate risks linked to critical infrastructure changes or failures.
• Act as a reliable technical advisor to clients and account stakeholders, including data center engineering and operations leadership.
• Participate in executive governance meetings (QBRs, technical reviews).
• Effectively communicate complex technical issues and solutions in a clear, business-oriented manner.
• Provide thought leadership and innovative recommendations to enhance client outcomes.
• Mentor and cultivate operations leaders, engineers, and technical teams.
• Establish competency frameworks and training programs for critical operations roles.
• Foster a culture of safety, accountability, and continuous improvement.
• Contribute to talent strategy and succession planning for technical roles.
• Bachelor’s degree in Engineering (Electrical, Mechanical, or related field) or equivalent experience.
• 12+ years of experience in data center operations, critical facilities, or mission-critical environments.
• 8+ years in senior leadership, technical advisory, or SME roles.
• Extensive knowledge of: Electrical systems (medium/low voltage, switchgear, UPS, generators, etc.).
• Mechanical systems (HVAC, chilled water, cooling technologies, fire systems, etc.).
• High-density liquid cooling systems and configurations.
• Operations and redundancies in critical environments.
• Proven track record supporting large-scale enterprise, hyperscale, or colocation environments.
• Strong experience with incident management, RCA methodologies, and risk mitigation strategies.
• Health insurance
• Vision insurance
• Dental insurance
• Flexible spending accounts
• Health savings accounts
• Retirement savings plans
• Life and disability insurance programs
• Paid and unpaid time away from work
Nex
First American
Get handpicked remote jobs straight to your inbox weekly.