Remotery

Senior Site Reliability Engineer

Posted May 9

This is a fully remote position, open to applicants in Australia.

📋 Description

• Enhancing production reliability and system resilience within a team focused on Site Reliability Engineering (SRE)

• Advocating for high-quality work and adherence to industry best practices

• Engaging with teams and stakeholders throughout all project phases

• Introducing innovative ideas and fostering a culture of creativity

• Addressing complex technical challenges with a proactive attitude

• Collaborating across various technologies in a rapidly evolving industry

• Participating in on-call rotations, managing incident responses, and conducting blameless post-incident reviews

• Writing code, responding to alerts, enhancing solutions, and providing support to team members

• Contributing significantly to the success of both your team and the company


⛳️ Requirements

• Over 5 years of experience managing Linux systems and associated infrastructure within production settings

• A collaborative mindset typical of SRE professionals, with knowledge of SLIs, SLOs, SLAs, error budgets, blast radius, and blameless postmortems

• A commitment to automation, reducing repetitive tasks, and minimizing problem recurrence

• Proven experience in creating runbooks that benefit the entire team, not just individual users

• Solid understanding of Kubernetes and its broader ecosystem

• Experience with cloud infrastructure, preferably AWS; familiarity with bare-metal setups is a plus

• Proficiency in tool development using Bash, and either Python or Go, or similar languages

• Familiarity with Infrastructure-as-Code tools, with a preference for Terraform

• Experience with CI/CD processes and version control, preferably GitHub

• Database expertise in one of the following: Postgres, Cassandra, or ClickHouse

• Experience managing a production observability stack (metrics, logs, traces), with a focus on extracting meaningful insights

• Comfortable working with live production infrastructure, exhibiting strong troubleshooting skills and ownership during incident responses

• A history of ongoing professional development

• A self-motivated approach suited for an asynchronous, globally distributed team, with a readiness to take on additional tasks as needed


🏝️ Benefits

• Flexible working arrangements

• Birthday leave

• Generous funding for study and training, along with 5 days of paid study leave

• Creative, enjoyable, and modern work environments

• A driven team of industry professionals alongside emerging talent

• Recognized achievements through ‘Legend’ and ‘Kudos’ awards

• Comprehensive health and wellness programs

People also viewed

Investigo10 hours ago

Senior Cloud - Kubernetes SRE

GB flagUnited Kingdom OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind10 hours ago

DevOps Engineer

AR flagArgentina OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cherokee Federal10 hours ago

DevSecOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$125k – $140k/year
ApplyView job
Avaya10 hours ago

Site Reliability Engineer – Azure, DevSecOps, IaC, Governance, Observability

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$129k – $143k/year
ApplyView job
Agilent Technologies10 hours ago

DevOps Engineer – Platform, AWS, CI/CD

US flagColorado OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$143.8k – $224.6k/year
ApplyView job
Dropbox10 hours ago

Site Reliability Engineer

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers