Remotery

Staff Site Reliability Engineer

Posted 22 hours ago

📋 Description

• Design Reliable Frameworks: Create frameworks and self-service tools that empower teams to maintain the reliability of their services within a “You Build It, You Run It” ethos.

• Lead AI-Powered Reliability Initiatives: Spearhead our AIOps strategy — automating diagnostics, remediation, and proactive measures for failure prevention.

• Advocate for a Reliability-Driven Culture: Integrate SRE practices throughout engineering by conducting design reviews, ensuring production readiness, and establishing operational standards.

• Incident Command Leadership: Serve as Incident Commander during critical incidents, exemplifying operational excellence and ensuring that blameless postmortems contribute to meaningful improvements.

• Enhance Observability: Implement comprehensive monitoring, tracing, and profiling (using tools like Prometheus, Grafana, OTEL, and Continuous Profiling) to proactively optimize performance.

• Mentor and Amplify: Foster the growth of engineers within SRE and product teams through mentorship, technical support, and knowledge sharing.


⛳️ Requirements

• Minimum of 8 years of experience in Site Reliability Engineering, DevOps, or a related field, including at least 3 years in a Senior+ SRE role.

• Solid experience in managing production SaaS systems at scale.

• Proficient in at least one programming or scripting language (such as Python, Go, or similar).

• Practical knowledge of cloud services (AWS, GCP, or Azure) and Kubernetes.

• Comprehensive understanding of networking principles (TCP/IP, DNS, HTTP/S, load balancing).

• Familiarity with monitoring and alerting tools (Prometheus, Grafana, Datadog, ELK).

• Knowledge of advanced observability concepts (OTEL, continuous profiling).

• Demonstrated experience in incident management, including leading high-severity incidents and conducting postmortems.

• Strong troubleshooting abilities across the entire stack.

• Exceptional communication and collaboration skills.


🏝️ Benefits

• AlphaSense is committed to being an equal-opportunity employer.

• Reasonable accommodations are available for qualified employees with protected disabilities.

People also viewed

Arctiq18 hours ago

Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Arctiq18 hours ago

Senior Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind18 hours ago

Senior DevOps Manager, German speaking

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mediastream18 hours ago

DevOps Engineer

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kyndryl18 hours ago

Site Reliability Engineer

US flagOhio OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$161.5k – $290.8k/year
ApplyView job
Guidehouse18 hours ago

Senior Azure DevOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118k – $196k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers