Remotery

Senior DevOps Engineer – Cloud, ML Infrastructure

Posted May 21

This is a fully remote position, open to applicants in Greece.

📋 Description

• Design, manage, and enhance Kpler’s cloud-native infrastructure (Kubernetes, networking, compute, storage).

• Contribute to Infrastructure as Code, CI/CD pipelines, and automation of the platform.

• Ensure the high availability, reliability, and security of production systems.

• Enhance observability, monitoring, alerting, and incident response processes.

• Minimize MTTR and failure rates through systematic reliability enhancements.

• Optimize the cost and performance of infrastructure, particularly for compute-intensive workloads.

• Support and assist in standardizing ML/GPU-based workloads within the current platform framework.

• Work closely with ML engineers, data engineers, and backend teams to facilitate production-grade deployments.

• Influence architectural decisions that guide the platform's evolution.


⛳️ Requirements

• Over 5 years of experience in cloud/platform engineering within production settings.

• Extensive hands-on experience with Kubernetes in a production environment.

• Familiarity with Infrastructure as Code (preferably Terraform).

• Strong knowledge of AWS or an equivalent cloud service provider.

• Experience in managing distributed systems in 24/7 operational environments.

• Robust operational mindset (SLOs, monitoring, incident management).

• Bachelor’s or Master’s degree in Computer Science, Engineering, or comparable practical experience.

• Proficient programming skills (Python or Go preferred).

• Comprehensive understanding of cloud-native architecture and principles of reliability engineering.


🏝️ Benefits

• Competitive salary and performance-based bonuses.

• Flexible work hours and remote work opportunities.

• Comprehensive health and wellness benefits.

• Professional development and training programs.

• Collaborative and innovative team environment.

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers