
Staff Site Reliability Engineer
Posted 22 hours ago

Posted 22 hours ago
• Design Reliable Frameworks: Create frameworks and self-service tools that empower teams to maintain the reliability of their services within a “You Build It, You Run It” ethos.
• Lead AI-Powered Reliability Initiatives: Spearhead our AIOps strategy — automating diagnostics, remediation, and proactive measures for failure prevention.
• Advocate for a Reliability-Driven Culture: Integrate SRE practices throughout engineering by conducting design reviews, ensuring production readiness, and establishing operational standards.
• Incident Command Leadership: Serve as Incident Commander during critical incidents, exemplifying operational excellence and ensuring that blameless postmortems contribute to meaningful improvements.
• Enhance Observability: Implement comprehensive monitoring, tracing, and profiling (using tools like Prometheus, Grafana, OTEL, and Continuous Profiling) to proactively optimize performance.
• Mentor and Amplify: Foster the growth of engineers within SRE and product teams through mentorship, technical support, and knowledge sharing.
• Minimum of 8 years of experience in Site Reliability Engineering, DevOps, or a related field, including at least 3 years in a Senior+ SRE role.
• Solid experience in managing production SaaS systems at scale.
• Proficient in at least one programming or scripting language (such as Python, Go, or similar).
• Practical knowledge of cloud services (AWS, GCP, or Azure) and Kubernetes.
• Comprehensive understanding of networking principles (TCP/IP, DNS, HTTP/S, load balancing).
• Familiarity with monitoring and alerting tools (Prometheus, Grafana, Datadog, ELK).
• Knowledge of advanced observability concepts (OTEL, continuous profiling).
• Demonstrated experience in incident management, including leading high-severity incidents and conducting postmortems.
• Strong troubleshooting abilities across the entire stack.
• Exceptional communication and collaboration skills.
• AlphaSense is committed to being an equal-opportunity employer.
• Reasonable accommodations are available for qualified employees with protected disabilities.
Arctiq
Arctiq
Software Mind
Mediastream
Get handpicked remote jobs straight to your inbox weekly.