Remotery

Site Reliability Engineer

Posted May 31

This is a fully remote position, open to applicants in Serbia.

📋 Description

• Oversee and deploy containerized applications across various environments, including GKE, EKS, and on-premises Kubernetes.

• Release updates to production environments, ensuring smooth rollouts.

• Manage and operate intricate infrastructure, guaranteeing high availability and peak performance through proactive system administration.

• Prepare and synchronize staging environments to replicate production conditions, including configuration adjustments and quality control validation support.

• Enhance and maintain monitoring and alerting systems (Prometheus, Grafana, Alertmanager) to boost system visibility.

• Engage in on-call rotations and conduct root-cause analysis (RCA) to address production issues and avert future occurrences.

• Develop and implement strategies for system troubleshooting, maintenance, and automated recovery.

• Manage and implement InfoSec configurations, covering firewall management, backup strategies, and security enhancements.

• Offer advanced technical support, resolving intricate escalations.

• Work alongside Project Managers and cross-functional teams to ensure infrastructure capabilities align with project needs and timelines.

• Investigate and integrate cutting-edge technologies to continuously enhance system reliability, security, and automation.


⛳️ Requirements

• A minimum of 3 years of demonstrated experience as an SRE, IT Operations Engineer, or Systems Administrator.

• Exceptional verbal and written communication skills in English.

• Proficient in Linux system administration with a robust understanding of networking principles.

• Practical experience in managing Kubernetes clusters.

• Familiarity with monitoring and alerting tools (Prometheus, Grafana, Alertmanager).

• Strong grasp of InfoSec principles and best practices.

• A proactive automation mindset with scripting proficiency in Python, Bash, or similar languages.

• Capable of juggling multiple priorities and thriving under pressure.

• A pragmatic, responsible, and collaborative approach to resolving issues.


🏝️ Benefits

• Fully remote work with a flexible schedule.

• Paid vacation and sick leave.

• Funding for relevant courses and online education.

• A positive company culture founded on honesty and mutual respect.

• Opportunities for live team events.

People also viewed

Experian1 hour ago

SRE Specialist

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
In All Media1 hour ago

Azure DevOps Engineer, ML Ops Engineer

Latin AmericaFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
knowmad mood1 hour ago

Backend Developer, PHP, React, DevOps

ES flagSpain OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Work Life Group2 hours ago

Lead DevOps Engineer, Data & AI Platform

HU flagHungary OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
accesa.eu2 hours ago

DevOps Engineer, German

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cisco3 hours ago

Site Reliability Engineer – Kubernetes Platform

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers