Remotery

Staff Site Reliability Engineer

Posted May 13

This is a fully remote position, open to applicants in India.

📋 Description

• Design Reliable Frameworks: Create frameworks and self-service tools that empower teams to maintain the reliability of their services within a “You Build It, You Run It” ethos.

• Lead AI-Powered Reliability Initiatives: Spearhead our AIOps strategy — automating diagnostics, remediation, and proactive measures for failure prevention.

• Advocate for a Reliability-Driven Culture: Integrate SRE practices throughout engineering by conducting design reviews, ensuring production readiness, and establishing operational standards.

• Incident Command Leadership: Serve as Incident Commander during critical incidents, exemplifying operational excellence and ensuring that blameless postmortems contribute to meaningful improvements.

• Enhance Observability: Implement comprehensive monitoring, tracing, and profiling (using tools like Prometheus, Grafana, OTEL, and Continuous Profiling) to proactively optimize performance.

• Mentor and Amplify: Foster the growth of engineers within SRE and product teams through mentorship, technical support, and knowledge sharing.


⛳️ Requirements

• Minimum of 8 years of experience in Site Reliability Engineering, DevOps, or a related field, including at least 3 years in a Senior+ SRE role.

• Solid experience in managing production SaaS systems at scale.

• Proficient in at least one programming or scripting language (such as Python, Go, or similar).

• Practical knowledge of cloud services (AWS, GCP, or Azure) and Kubernetes.

• Comprehensive understanding of networking principles (TCP/IP, DNS, HTTP/S, load balancing).

• Familiarity with monitoring and alerting tools (Prometheus, Grafana, Datadog, ELK).

• Knowledge of advanced observability concepts (OTEL, continuous profiling).

• Demonstrated experience in incident management, including leading high-severity incidents and conducting postmortems.

• Strong troubleshooting abilities across the entire stack.

• Exceptional communication and collaboration skills.


🏝️ Benefits

• AlphaSense is committed to being an equal-opportunity employer.

• Reasonable accommodations are available for qualified employees with protected disabilities.

People also viewed

N2JSoft, administrative and HR softwares1 day ago

DevOps confirmé

FR flagFrance OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€60k/year
ApplyView job
It's Prodigy1 day ago

DevOps Engineer, Cloud

Anywhere in the WorldFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
ARA2 days ago

Senior Site Reliability Engineer

US flagNew Mexico OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kenlo3 days ago

Analista de Infraestrutura, SRE, DevOps

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Ad Hoc LLC3 days ago

Senior Site Reliability Engineer

North AmericaFull-timeDevOps & Site Reliability Engineer (SRE)$135k – $150k/year
ApplyView job
Assured4 days ago

Staff Database Reliability Engineer, DBRE

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$165k – $185k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers