
Director, Operations SME – Data Centers – Critical Environment
Posted 1 hour ago

Posted 1 hour ago
This is a fully remote position, open to applicants in Texas.
• Assist in the creation of solutions for managing clients' data center portfolios.
• Collaborate with technical transition leadership to ensure thorough due diligence and site start-up are executed as planned.
• Serve as the primary subject matter expert focused on critical facility operations, including electrical distribution, UPS systems, generators, cooling systems (CRAC/CRAH), and BMS/EPMS platforms.
• Possess a comprehensive understanding of high-density data center layouts and advanced liquid cooling technologies, including operations and maintenance.
• Offer expert advice on incident response, root cause analysis (RCA), and risk mitigation strategies.
• Review and authorize MOPs, SOPs, and EOPs, ensuring technical precision and compliance with industry best practices.
• Lead technical assessments, audits, and operational readiness evaluations across data center sites.
• Assist with complex troubleshooting and high-severity incidents (SEVs) as the escalation authority.
• Implement and advise on training protocols for site teams.
• Facilitate the deployment of operational platforms for managing maintenance and operations of critical data center locations.
• Maintain ongoing collaboration with Account Directors and clients to ensure operational alignment with business objectives, SLAs, and reliability goals.
• Ensure a comprehensive and well-structured handover to the ongoing account management team for operational continuity.
• Promote standardization and scalability of operating models across multi-site or global accounts.
• Identify operational deficiencies and spearhead corrective action plans and performance improvement initiatives.
• Act as a strategic advisor during contract solutioning, transitions, mobilizations, and expansions.
• Advocate for best practices in uptime, redundancy, and resilience across all data center environments.
• Encourage the adoption of reliability-centered maintenance (RCM), predictive maintenance, and condition-based monitoring.
• Analyze performance data to identify trends and proactively address risks.
• Lead initiatives aimed at enhancing energy efficiency, sustainability, and cost optimization.
• Ensure compliance with regulatory and industry standards (OSHA, NFPA, ISO, Uptime Institute guidelines, etc.).
• Assist in audit preparation, compliance reviews, and risk management frameworks.
• Develop and implement standardized operational controls and governance models.
• Assess and mitigate risks linked to critical infrastructure changes or failures.
• Function as a trusted technical advisor to clients and account stakeholders, including data center engineering and operations leadership.
• Participate in executive governance meetings (QBRs, technical reviews).
• Articulate complex technical issues and solutions in a clear, business-oriented manner.
• Provide thought leadership and innovation suggestions to enhance client outcomes.
• Mentor and develop operations leaders, engineers, and technical teams.
• Establish competency frameworks and training programs for critical operations roles.
• Foster a culture of safety, accountability, and continuous improvement.
• Support talent strategy and succession planning for technical roles.
• Bachelor’s degree in Engineering (Electrical, Mechanical, or a related field) or equivalent experience.
• Over 12 years of experience in data center operations, critical facilities, or mission-critical environments.
• At least 8 years in senior leadership, technical advisory, or subject matter expert roles.
• In-depth knowledge of: Electrical systems (medium/low voltage, switchgear, UPS, generators, etc.).
• Mechanical systems (HVAC, chilled water, cooling technologies, fire, etc.).
• High-density liquid cooling systems and configurations.
• Operations and redundancies in critical environments.
• Proven experience in supporting large-scale enterprise, hyperscale, or colocation environments.
• Strong background in incident management, RCA methodologies, and risk mitigation.
• Health insurance.
• Vision insurance.
• Dental insurance.
• Flexible spending accounts.
• Health savings accounts.
• Retirement savings plans.
• Life and disability insurance programs.
• Paid and unpaid time away from work.
Telefónica Tech
RTX
Arctic Wolf
DaVita Kidney Care
Get handpicked remote jobs straight to your inbox weekly.