Remotery

Senior Site Reliability Engineer, SRE

Posted 6 days ago

This is a fully remote position, open to applicants in Cyprus.

📋 Description

• Design and manage our Kubernetes ecosystem (GKE, multi-cluster) with an emphasis on high availability and seamless operations without downtime.

• Take ownership of our PaaS strategy, utilizing GitOps (ArgoCD) and CI/CD (GitLab) to enable domain teams to deploy autonomously.

• Establish and execute our observability strategy encompassing metrics, logs, and tracing (Prometheus, VictoriaMetrics, OpenTelemetry).

• Spearhead the automation of our infrastructure using Terraform, ensuring that all resources are standardized and version-controlled.

• Collaborate with engineering teams to set up and oversee SLOs, SLAs, and frameworks for incident management.

• Design and partake in routine disaster recovery drills, implementing blue/green and active/passive strategies across regions to guarantee service continuity.

• Proactively utilize AI-driven methodologies to enhance operational efficiency and automate bottleneck detection.


⛳️ Requirements

• Extensive hands-on experience in managing Kubernetes (preferably GKE) within high-load, multi-cluster production settings.

• Profound expertise with GCP (with AWS as a significant advantage) and Terraform for large-scale infrastructure deployment.

• Strong background in ArgoCD, GitLab CI, and the principles of 'Infrastructure as Code'.

• In-depth knowledge of the Prometheus/Grafana stack and experience in implementing tracing/logging at scale.

• Demonstrated ability to design highly available systems that operate 24/7 with automated failover and rollback functionalities.

• Proficient in English at a B2+ level for effective communication across functional teams.


🏝️ Benefits

• Make a genuine impact on the product.

• Join our upward trajectory and grow with us. We provide the resources and opportunities for continuous personal and professional development, empowering you to make a genuine impact on our evolving product.

• Work in the EU.

• Enjoy the flexibility of traveling and working remotely or in a hybrid model across Europe.

• Become a stock options holder.

• Unlock your inner entrepreneur and align your aspirations with ours through our Stock Options Program.

• Receive unwavering support and care.

• Finom stands by you at every step, embodying our commitment to your well-being and success, reflected in our modern, friendly, and eco-conscious corporate culture.

• Work & Swim program.

• Immerse yourself in our exclusive Work & Swim Program. Spend one month in a comfortable corporate apartment in enchanting Cyprus. It's the ideal opportunity to strike the perfect work-life balance while enjoying breathtaking Mediterranean views.

• Equal Opportunity Statement.

• At Finom, we're an equal opportunity employer and value diversity at our company. We embrace diversity and invite applications from all walks of life.

People also viewed

Work Life Group35 min ago

Lead DevOps Engineer, Data & AI Platform

HU flagHungary OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
accesa.eu35 min ago

DevOps Engineer, German

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cisco41 min ago

Site Reliability Engineer – Kubernetes Platform

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Work Life Group48 min ago

Lead DevOps Engineer – Data & AI Platform

CZ flagCzechia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
JumpCloud48 min ago

Security Engineer, DevSecOps

MX flagMexico OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Unit448 min ago

Cloud Operations Engineer

PT flagPortugal OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€30.5k – €35.1k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers