
Staff SRE, Ads
Posted 22 hours ago

Posted 22 hours ago
This is a fully remote position, open to applicants in United Kingdom.
• Oversee reliability projects across various Ads domains, including ad serving, auctions, targeting, reporting, measurement, and billing.
• Collaborate with engineering leadership to enhance reliability, scalability, operational excellence, and engineering efficiency throughout the Ads organization.
• Facilitate architecture reviews and influence technical choices affecting critical revenue-generating systems.
• Create and implement platforms, tools, and automation that boost reliability and developer productivity at scale.
• Engage in on-call rotations, lead intricate incident investigations, and coordinate cross-functional response efforts during significant production events.
• Recognize systemic reliability risks and develop long-term solutions to enhance platform resilience.
• Establish reliability metrics centered around advertiser-critical user journeys, which include campaign creation, ad delivery, auction participation, reporting, attribution, and billing.
• Mentor engineers and offer technical leadership across various teams.
• Shape roadmap planning and ensure that reliability factors are integrated into product and infrastructure investments.
• Over 8 years of experience in Site Reliability Engineering, Infrastructure Engineering, or similar roles managing large-scale distributed systems.
• Significant experience in supporting high-traffic, user-facing production environments.
• Profound knowledge of distributed systems, networking, Linux systems, and cloud-native architectures.
• Proven experience in designing highly available systems with robust operational and reliability practices.
• Strong grasp of observability systems, including metrics, logging, tracing, and alerting.
• Proficient programming skills in languages such as Go, Python, or equivalent.
• Experience in enhancing reliability through SLOs, automation, incident management, and performance optimization.
• Proven ability to troubleshoot complex issues within a modern distributed system stack.
• Excellent collaboration and communication skills with the capacity to influence technical direction across teams.
• Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support.
• Family Planning Support.
• Gender-Affirming Care.
• Mental Health & Coaching Benefits.
• Group Personal Pension Scheme with Employer match.
• Private Medical and Dental Scheme.
• Income Replacement Programs.
• Bike to Work scheme.
• Flexible Vacation & Paid Volunteer Time Off.
• Generous Paid Parental Leave.
Investigo
Software Mind
Cherokee Federal
Avaya
Get handpicked remote jobs straight to your inbox weekly.