Remotery

Site Reliability Engineer

Posted 1 day ago

This is a fully remote position, open to applicants in California.

📋 Description

• Design, develop, and uphold scalable and dependable systems within cloud environments such as Azure Cloud Services.

• Provide operational assistance for comprehensive software applications.

• Enhance system resilience through advanced coding, rigorous release, and change management capabilities.

• Create service-level indicators and objectives to automate the release validation process.

• Boost automation efforts and enhance the system’s self-healing abilities.

• Gather operating system data and present performance metrics to stakeholders.

• Ensure adherence to security best practices in cloud infrastructure and application deployments.

• Oversee the maintenance of cloud and database systems, addressing production issues as they arise.

• Elevate the reliability, quality, and time-to-market of our suite of software solutions.

• Collaborate with security and product teams to establish and disseminate policies, processes, and playbooks for rapid and effective alert and incident management.

• Direct incident management processes; respond swiftly to outages and service disruptions.


⛳️ Requirements

• Bachelor’s degree in computer science or a related field.

• Five years of experience as a site reliability engineer or in a comparable role.

• Strong programming abilities in languages such as Golang, Ruby, Python, or similar.

• Demonstrated capability to diagnose and monitor performance and reliability challenges across the entire stack.

• Proficiency in Kubernetes.

• Relevant industry certifications, such as those from the Site Reliability Engineering (SRE) Foundation.

• Proven experience with cloud-native infrastructure, including Azure Cloud Services, AWS, or GCP.

• Familiarity with observability and incident management tools, such as Datadog, OpsGenie, or PagerDuty.

• Experience in scripting operating system tasks using Infrastructure as Code.

• Exemplary communication skills.

• Ability to address problems in a fast-paced, high-stakes environment.

• The candidate must be able to provide verification of their legal right to work in the United States without requiring company sponsorship.


🏝️ Benefits

• Competitive salary and performance-based bonuses.

• Comprehensive health, dental, and vision insurance.

• Opportunities for professional development and continuous learning.

• Flexible work hours and remote work options.

• A collaborative and inclusive work environment.

People also viewed

Ping Identity5 hours ago

Staff Site Reliability Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$136.3k – $170k/year
ApplyView job
Stack AV5 hours ago

Site Reliability Engineer

US flagPennsylvania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
May Mobility5 hours ago

Autonomy Release Engineer II

US flagMichigan OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$128k – $165k/year
ApplyView job
Practical DevSecOps5 hours ago

Senior Security Engineer, Content Engineering

US flagCalifornia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
High 5 Games6 hours ago

DevOps Engineer – ML & Data Infrastructure

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mercury Insurance6 hours ago

Manager – Site Reliability Operations

US flagCalifornia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118.7k – $230.6k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers