
Site Reliability Engineer
Posted May 31

Posted May 31
This is a fully remote position, open to applicants in Serbia.
• Oversee and deploy containerized applications across various environments, including GKE, EKS, and on-premises Kubernetes.
• Release updates to production environments, ensuring smooth rollouts.
• Manage and operate intricate infrastructure, guaranteeing high availability and peak performance through proactive system administration.
• Prepare and synchronize staging environments to replicate production conditions, including configuration adjustments and quality control validation support.
• Enhance and maintain monitoring and alerting systems (Prometheus, Grafana, Alertmanager) to boost system visibility.
• Engage in on-call rotations and conduct root-cause analysis (RCA) to address production issues and avert future occurrences.
• Develop and implement strategies for system troubleshooting, maintenance, and automated recovery.
• Manage and implement InfoSec configurations, covering firewall management, backup strategies, and security enhancements.
• Offer advanced technical support, resolving intricate escalations.
• Work alongside Project Managers and cross-functional teams to ensure infrastructure capabilities align with project needs and timelines.
• Investigate and integrate cutting-edge technologies to continuously enhance system reliability, security, and automation.
• A minimum of 3 years of demonstrated experience as an SRE, IT Operations Engineer, or Systems Administrator.
• Exceptional verbal and written communication skills in English.
• Proficient in Linux system administration with a robust understanding of networking principles.
• Practical experience in managing Kubernetes clusters.
• Familiarity with monitoring and alerting tools (Prometheus, Grafana, Alertmanager).
• Strong grasp of InfoSec principles and best practices.
• A proactive automation mindset with scripting proficiency in Python, Bash, or similar languages.
• Capable of juggling multiple priorities and thriving under pressure.
• A pragmatic, responsible, and collaborative approach to resolving issues.
• Fully remote work with a flexible schedule.
• Paid vacation and sick leave.
• Funding for relevant courses and online education.
• A positive company culture founded on honesty and mutual respect.
• Opportunities for live team events.
Experian
In All Media
knowmad mood
Work Life Group
Get handpicked remote jobs straight to your inbox weekly.