
Senior IT Site Reliability Manager – f/m/x
Posted May 23

Posted May 23
This is a fully remote position, open to applicants in Germany.
• Oversee and manage the daily IT operations and service delivery.
• Ensure optimal system availability, reliability, and performance across all platforms.
• Propel, develop, and expand Site Reliability Engineering (SRE) practices, including monitoring, incident response, and automation.
• Take ownership and enhance 24/7 operational readiness, including on-call models and escalation protocols.
• Work closely with development teams in agile frameworks (e.g., SAFe) to bolster system resilience and scalability.
• Continuously identify and implement enhancements based on incident analysis, key performance indicators (KPIs), and operational insights.
• Align operations with ITIL processes (Critical Incident, Incident, Problem, Change Management).
• Manage and optimize cloud infrastructure, primarily within AWS environments.
• Serve as a liaison between operations, engineering, and business stakeholders.
• Extensive experience at a senior level in IT operations, Site Reliability Engineering, or a similar role.
• Strong proficiency in ITIL-based service management and operational best practices.
• Practical experience with AWS cloud systems.
• Familiarity with working in agile environments, preferably SAFe.
• Comprehensive understanding of IT operations requirements, processes, and methodologies.
• Excellent analytical, problem-solving, and decision-making abilities.
• Innovative mindset with enthusiasm for exploring and adopting new technologies (early adopter mentality).
• Outstanding interpersonal skills with a high degree of empathy and the capacity to engage effectively with individuals across the organization.
• Strong team player with a collaborative and proactive attitude.
• Ability to function effectively in a 24/7 environment.
• Degree in Computer Science, Information Technology, or a related discipline (or equivalent experience).
• ITIL certification is advantageous.
• Experience with monitoring, automation, and DevOps methodologies and culture is highly desirable.
• Proficiency in English at C1 level or higher (required).
• Flexible work arrangements.
• Opportunities for professional development.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.