
Engineering Manager – Infrastructure
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in Belgium.
• Lead, mentor, and develop a team of six infrastructure engineers.
• Drive initiatives focused on infrastructure scalability, resilience, and operational excellence.
• Maintain active technical involvement in architecture, debugging, automation, and incident management.
• Oversee and manage the team’s on-call schedules and incident response protocols.
• Define and track operational KPIs, SLIs, and SLOs to ensure infrastructure reliability and performance.
• Develop and enhance financial governance related to infrastructure and cloud expenditures.
• Propel initiatives aimed at achieving zero-downtime deployments and ensuring highly available distributed systems.
• Enhance observability, monitoring, alerting, and incident management methodologies.
• Collaborate closely with Product and Engineering teams to meet platform scalability and performance objectives.
• Lead post-mortem reviews and root cause analyses to continuously enhance platform reliability.
• Cultivate a robust engineering culture that emphasizes ownership, collaboration, automation, and continuous improvement.
• Contribute to the long-term infrastructure and platform strategy.
• Over six years of experience in Software, Infrastructure, Platform Engineering, or SRE roles, including management responsibilities.
• Demonstrated success in leading and mentoring engineering teams in high-scale environments.
• Extensive experience in designing and operating scalable distributed systems.
• Practical experience with high availability, disaster recovery, incident management, and zero-downtime strategies.
• Deep expertise in AWS, Kubernetes, Docker, and cloud-native architectures.
• Strong engineering background in Kotlin or Java ecosystems.
• Experience in implementing monitoring, alerting, logging, and observability best practices.
• Skilled in defining and optimizing infrastructure cost KPIs and managing cloud spending governance.
• Experience managing on-call rotations, incident response, and operational processes.
• Excellent interpersonal and stakeholder management skills, with the capacity to align technical initiatives with business objectives.
• Passionate about fostering the technical and professional growth of engineers.
• Strong analytical mindset with the ability to make pragmatic technical and operational decisions in dynamic environments.
• Fluent in English (CEFR C1 or C2); additional European languages are a bonus.
• Remote-first culture.
• In-person meetings at least once a quarter.
• Opportunities for professional growth.
refurbed
Atlan Stormwater
Hint Health
Trust Wallet
Get handpicked remote jobs straight to your inbox weekly.