Remotery

Principal Site Reliability Engineer

Posted May 22

This is a fully remote position, open to applicants in South Africa.

📋 Description

• Design and develop cutting-edge cloud-native infrastructure.

• Lead technical discussions with clients and formulate technical roadmaps.

• Collaborate with the Engineering Director(s) to (re)architect systems.

• Assist the Site Reliability Manager in resource allocation planning.

• Support engineering managers in creating career development paths for those aiming to advance to Principal Engineer roles.

• Teach, mentor, and provide guidance to domain experts, individual contributors, and multiple teams.

• Document workflows and track performance metrics.

• Facilitate discussions to eliminate obstacles and promote teamwork across departments.

• Continuously enhance the stability, scalability, security, cost-efficiency, and operational excellence of our clients' systems.

• Proactively discover, assess, and adopt new technologies to optimize development efficiency and security.

• Conduct planning, testing, and development of infrastructure.

• Provide technical leadership across various projects.


⛳️ Requirements

• Minimum of 7 years of experience in a DevOps/SRE environment.

• Extensive background in DevOps/SRE, team leadership, and collaboration.

• In-depth knowledge of data encryption best practices and cybersecurity.

• Advanced understanding of the DevOps/SRE ecosystem, architectures, and emerging technologies.

• Experience with cloud platforms, preferably GCP, Azure, and AWS.

• Familiarity with Observability Practices and Incident Management.

• Extensive expertise with Prometheus, Grafana, the Elastic Stack, and all versions of Beats, particularly in Kubernetes contexts.

• Proficient in Infrastructure as Code, preferably using Terraform.

• Experience with general automation and configuration management, ideally Ansible.

• Significant experience in building and maintaining Kubernetes clusters and workloads.

• Strong foundational knowledge of basic networking and security principles.

• Ability to create robust CI/CD pipelines.

• Familiarity with both relational and non-relational databases.

• Solid understanding of Linux operating systems.


🏝️ Benefits

• Flexibility and the opportunity to work remotely.

• Work-life balance, with no expectation to work weekends or after hours.

• A progressive remote company that values team connectivity through virtual social platforms for employee engagement.

• A monthly allowance for remote work setup to ensure you are comfortable at home.

• A MacBook or Windows laptop to facilitate your best work.

• Join a team of exceptionally skilled and talented individuals who are eager to share knowledge and experiences.

• We support your career development and celebrate your achievements and advancements!

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers