Remotery

Site Reliability Engineer

atAceHack 4.0US flagUnited StatesFull-timeDevOps & Site Reliability Engineer (SRE)Mid-levelSenior$180k – $250k/year

Posted 2 hours ago

📋 Description

• Take ownership of reliability, availability, and performance for production systems operating in cloud environments.

• Establish and monitor SLIs/SLOs while assisting in managing error budgets across the platform.

• Lead incident response initiatives encompassing detection, triage, mitigation, and post-incident reviews.

• Enhance observability through effective logging, monitoring, alerting, and dashboard implementations.

• Automate operational processes and minimize manual tasks wherever feasible.

• Collaborate closely with engineering teams to bolster system resilience and scalability.

• Assist in capacity planning, infrastructure optimization, and performance enhancement.

• Develop internal tools, runbooks, and best practices for operations.

• Provide support for Kubernetes-based infrastructure and large-scale distributed systems.

• Serve as an escalation point for intricate production and platform challenges.


⛳️ Requirements

• Over 5 years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or comparable infrastructure positions.

• Extensive experience with cloud platforms such as AWS, GCP, or Azure.

• Practical experience with Kubernetes and containerized environments.

• Strong grasp of distributed systems and microservices architecture.

• Familiarity with observability tools such as Prometheus, Grafana, Datadog, ELK, or OpenTelemetry.

• Skilled in infrastructure automation and scripting (Terraform, Python, Bash, etc.).

• Experience with managing CI/CD pipelines and automating deployments.

• Excellent troubleshooting and incident management capabilities.

• Ability to collaborate across functions and communicate effectively in high-pressure scenarios.


🏝️ Benefits

• Comprehensive health coverage including medical, dental, and vision.

• Flexible paid time off (PTO).

• Support for personal development.

People also viewed

Launch Potato2 hours ago

Lead DevOps/SRE Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Xtremepush2 hours ago

Senior DevOps Engineer, AWS

LT flagLithuania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
BI2run2 hours ago

BI DevOps Engineer – m/w/d

DE flagGermany OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€50k – €70k/year
ApplyView job
S + S Regeltechnik GmbH2 hours ago

Team Leader – DevOps

DE flagGermany OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
NVIDIA2 hours ago

Senior Network Reliability Engineer – DGX Cloud

US flagCalifornia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$136k – $264.5k/year
ApplyView job
Newfold Digital2 hours ago

Principal Dev Ops Engineer

AR flagArgentina OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers