Remotery

Senior Site Reliability Engineer – SRE

Posted Jun 19

This is a fully remote position, open to applicants in Spain.

📋 Description

• Drive Operational Excellence: Design, implement, and maintain highly available, scalable, and resilient systems that provide an outstanding customer experience.

• Datadog Expert: Serve as one of the key experts for Datadog, responsible for defining and executing best practices.

• Software Development for Reliability: Create robust, well-tested, and maintainable software to automate operational tasks.

• Toil Reduction Champion: Identify and eliminate toil through automation and process enhancements.

• Incident Management & Post-Mortems: Lead blameless post-mortems and contribute to the incident response framework.

• Reliability Metrics & Goals: Collaborate to define, implement, and monitor Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets.

• Infrastructure as Code: Utilize and contribute to infrastructure as code initiatives.

• System Design & Architecture: Provide SRE expertise during system design reviews.

• Knowledge Sharing & Mentorship: Document processes and share expertise with the team.


⛳️ Requirements

• Proven experience in operating and enhancing production systems at scale in an SRE, Production Engineering, or Platform Engineering capacity.

• Demonstrated ability to quickly develop accurate mental models of complex distributed systems across infrastructure, applications, networking, identity, and observability domains.

• Strong troubleshooting capabilities with a methodical, evidence-based approach to incident response and root cause analysis.

• Experience in defining, implementing, and utilizing Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets to inform reliability decisions.

• Exceptional written and verbal communication skills, with the capability to articulate complex technical issues clearly to both technical and non-technical audiences.


🏝️ Benefits

• Flexible work arrangements.

• Professional development opportunities.

• Continuous improvement culture.

• Mentorship opportunities.

People also viewed

N2JSoft, administrative and HR softwares1 day ago

DevOps confirmé

FR flagFrance OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€60k/year
ApplyView job
It's Prodigy1 day ago

DevOps Engineer, Cloud

Anywhere in the WorldFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
ARA2 days ago

Senior Site Reliability Engineer

US flagNew Mexico OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kenlo3 days ago

Analista de Infraestrutura, SRE, DevOps

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Ad Hoc LLC3 days ago

Senior Site Reliability Engineer

North AmericaFull-timeDevOps & Site Reliability Engineer (SRE)$135k – $150k/year
ApplyView job
Assured4 days ago

Staff Database Reliability Engineer, DBRE

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$165k – $185k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers