Remotery

Senior Site Reliability Engineer, SRE

Posted May 30

This is a fully remote position, open to applicants in Armenia.

📋 Description

• Lead the design, implementation, and continuous improvement of our Kubernetes-based platform across multiple cloud environments (GCP/AWS).

• Architect and manage our Kubernetes ecosystem, prioritizing high availability and operations without downtime.

• Take ownership of our PaaS strategy, enabling domain teams to deploy autonomously.

• Establish and execute our observability strategy encompassing metrics, logs, and tracing.

• Spearhead the automation of our infrastructure utilizing Terraform.

• Collaborate with engineering teams to oversee SLOs, SLAs, and frameworks for incident management.

• Design and engage in regular disaster recovery simulations.

• Implement AI-driven strategies to enhance operational efficiency.


⛳️ Requirements

• Extensive hands-on experience in managing Kubernetes (preferably GKE) within high-load, multi-cluster production settings.

• Profound expertise in GCP (AWS knowledge is a significant advantage) and Terraform for managing large-scale infrastructure.

• Strong familiarity with ArgoCD, GitLab CI, and the principles of 'Infrastructure as Code'.

• In-depth understanding of the Prometheus/Grafana stack and the implementation of tracing/logging at scale.

• Demonstrated ability to design systems that maintain high availability 24/7, featuring automated failover and rollback processes.

• Proficient in English at a B2+ level for effective communication across functions.


🏝️ Benefits

• Make a meaningful impact on the product.

• Join our growth journey with access to resources and opportunities for both personal and professional development.

• Work within the EU with the flexibility to travel and operate remotely or in a hybrid model throughout Europe.

• Become a stock options holder through our Stock Options Program available to all team members.

• Enjoy unwavering support and care that reflects our dedication to your well-being and success.

• Engage in our exclusive Work & Swim Program in Cyprus for a balanced work-life experience.

• We are an Equal Opportunity Employer that values diversity.

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers