
Staff SRE, Ads
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in Ireland.
• Spearhead reliability initiatives across various Ads domains, including ad serving, auctions, targeting, reporting, measurement, and billing.
• Collaborate with engineering leadership to enhance reliability, scalability, operational excellence, and engineering efficiency within the Ads organization.
• Facilitate architecture reviews and influence technical decisions that affect critical revenue-generating systems.
• Create and develop platforms, tools, and automation that enhance reliability and boost developer productivity at scale.
• Engage in on-call rotations, lead intricate incident investigations, and coordinate cross-functional response efforts during significant production events.
• Recognize systemic reliability risks and implement long-term solutions that strengthen platform resilience.
• Establish reliability metrics related to advertiser-critical user journeys, such as campaign creation, ad delivery, auction participation, reporting, attribution, and billing.
• Mentor engineers and offer technical leadership across various teams.
• Shape roadmap planning and ensure that reliability considerations are integrated into product and infrastructure investments.
• Over 8 years of experience in Site Reliability Engineering, Infrastructure Engineering, or similar roles managing large-scale distributed systems.
• Extensive experience maintaining high-traffic, user-facing production environments.
• Profound understanding of distributed systems, networking, Linux systems, and cloud-native architectures.
• Proven experience in designing highly available systems with robust operational and reliability practices.
• Comprehensive knowledge of observability systems, including metrics, logging, tracing, and alerting.
• Proficient programming skills in languages such as Go, Python, or similar.
• Experience enhancing reliability through SLOs, automation, incident management, and performance optimization.
• Proven ability to troubleshoot complex issues across a contemporary distributed system stack.
• Excellent collaboration and communication skills, with the ability to influence technical direction across teams.
• Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
• Family Planning Support
• Gender-Affirming Care
• Mental Health & Coaching Benefits
• Private Medical, Dental, and Vision Benefits
• Personal Retirement Savings Account with matching contribution
• Cycle to Work and Tax Saver schemes
• Flexible Vacation & Paid Volunteer Time Off
• Generous Paid Parental Leave
Investigo
Software Mind
Cherokee Federal
Avaya
Get handpicked remote jobs straight to your inbox weekly.