Remotery

Senior IT Site Reliability Manager – f/m/x

Posted May 23

This is a fully remote position, open to applicants in Germany.

📋 Description

• Oversee and manage the daily IT operations and service delivery.

• Ensure optimal system availability, reliability, and performance across all platforms.

• Propel, develop, and expand Site Reliability Engineering (SRE) practices, including monitoring, incident response, and automation.

• Take ownership and enhance 24/7 operational readiness, including on-call models and escalation protocols.

• Work closely with development teams in agile frameworks (e.g., SAFe) to bolster system resilience and scalability.

• Continuously identify and implement enhancements based on incident analysis, key performance indicators (KPIs), and operational insights.

• Align operations with ITIL processes (Critical Incident, Incident, Problem, Change Management).

• Manage and optimize cloud infrastructure, primarily within AWS environments.

• Serve as a liaison between operations, engineering, and business stakeholders.


⛳️ Requirements

• Extensive experience at a senior level in IT operations, Site Reliability Engineering, or a similar role.

• Strong proficiency in ITIL-based service management and operational best practices.

• Practical experience with AWS cloud systems.

• Familiarity with working in agile environments, preferably SAFe.

• Comprehensive understanding of IT operations requirements, processes, and methodologies.

• Excellent analytical, problem-solving, and decision-making abilities.

• Innovative mindset with enthusiasm for exploring and adopting new technologies (early adopter mentality).

• Outstanding interpersonal skills with a high degree of empathy and the capacity to engage effectively with individuals across the organization.

• Strong team player with a collaborative and proactive attitude.

• Ability to function effectively in a 24/7 environment.

• Degree in Computer Science, Information Technology, or a related discipline (or equivalent experience).

• ITIL certification is advantageous.

• Experience with monitoring, automation, and DevOps methodologies and culture is highly desirable.

• Proficiency in English at C1 level or higher (required).


🏝️ Benefits

• Flexible work arrangements.

• Opportunities for professional development.

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers