Remotery

Staff SRE, Ads

Posted 22 hours ago

This is a fully remote position, open to applicants in United Kingdom.

📋 Description

• Oversee reliability projects across various Ads domains, including ad serving, auctions, targeting, reporting, measurement, and billing.

• Collaborate with engineering leadership to enhance reliability, scalability, operational excellence, and engineering efficiency throughout the Ads organization.

• Facilitate architecture reviews and influence technical choices affecting critical revenue-generating systems.

• Create and implement platforms, tools, and automation that boost reliability and developer productivity at scale.

• Engage in on-call rotations, lead intricate incident investigations, and coordinate cross-functional response efforts during significant production events.

• Recognize systemic reliability risks and develop long-term solutions to enhance platform resilience.

• Establish reliability metrics centered around advertiser-critical user journeys, which include campaign creation, ad delivery, auction participation, reporting, attribution, and billing.

• Mentor engineers and offer technical leadership across various teams.

• Shape roadmap planning and ensure that reliability factors are integrated into product and infrastructure investments.


⛳️ Requirements

• Over 8 years of experience in Site Reliability Engineering, Infrastructure Engineering, or similar roles managing large-scale distributed systems.

• Significant experience in supporting high-traffic, user-facing production environments.

• Profound knowledge of distributed systems, networking, Linux systems, and cloud-native architectures.

• Proven experience in designing highly available systems with robust operational and reliability practices.

• Strong grasp of observability systems, including metrics, logging, tracing, and alerting.

• Proficient programming skills in languages such as Go, Python, or equivalent.

• Experience in enhancing reliability through SLOs, automation, incident management, and performance optimization.

• Proven ability to troubleshoot complex issues within a modern distributed system stack.

• Excellent collaboration and communication skills with the capacity to influence technical direction across teams.


🏝️ Benefits

• Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support.

• Family Planning Support.

• Gender-Affirming Care.

• Mental Health & Coaching Benefits.

• Group Personal Pension Scheme with Employer match.

• Private Medical and Dental Scheme.

• Income Replacement Programs.

• Bike to Work scheme.

• Flexible Vacation & Paid Volunteer Time Off.

• Generous Paid Parental Leave.

People also viewed

Investigo8 hours ago

Senior Cloud - Kubernetes SRE

GB flagUnited Kingdom OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind8 hours ago

DevOps Engineer

AR flagArgentina OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cherokee Federal8 hours ago

DevSecOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$125k – $140k/year
ApplyView job
Avaya8 hours ago

Site Reliability Engineer – Azure, DevSecOps, IaC, Governance, Observability

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$129k – $143k/year
ApplyView job
Agilent Technologies8 hours ago

DevOps Engineer – Platform, AWS, CI/CD

US flagColorado OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$143.8k – $224.6k/year
ApplyView job
Dropbox8 hours ago

Site Reliability Engineer

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers