Remotery

DevOps Engineer

Posted Jun 3

This is a fully remote position, open to applicants in Germany.

📋 Description

• Take charge of and manage our Kubernetes clusters (AWS EKS for production, bare-metal K3s for machine learning training, and on-premises appliances operating K3s) utilizing GitOps (FluxCD) and Infrastructure as Code (CloudFormation, Ansible).

• Oversee the lifecycle of on-premises gateway appliances implemented at customer locations — including VM image creation, TLS certificate automation, staged rollouts via GitOps, as well as remote monitoring and troubleshooting.

• Establish and sustain monitoring, alerting, and observability infrastructure (Prometheus, Grafana, Loki, Tempo, OpenTelemetry) across both cloud and edge environments.

• Propel compliance automation initiatives — focusing on security hardening, access controls, audit logging, encrypted secrets management (SOPS/KMS), and evidence collection for C5, HIPAA, and HDS certifications.

• Assist in scaling ML training and data processing infrastructure — managing GPU clusters, orchestrating training jobs, and ensuring data pipeline reliability, while collaborating closely with the ML team to enhance advanced speech and language models.

• Handle incident response — encompassing detection, triage, resolution, and conducting post-incident reviews for infrastructure challenges.


⛳️ Requirements

• Extensive experience in operating Kubernetes in production environments (cluster management, troubleshooting, networking, storage).

• Familiarity with GitOps workflows.

• Proficient in working with monitoring and observability tools.

• Security-focused — possessing a strong understanding of encryption, access controls, and secrets management.

• Robust Linux systems administration capabilities, along with experience in designing and maintaining CI/CD pipelines.

• Willingness to participate in on-call shift rotations.


🏝️ Benefits

• 30 vacation days in addition to your birthday off.

• Germany Transport Ticket.

• Urban Sports Club membership.

• Regular company off-site events.

• Access to learning platforms such as Blinkist and Audible.

• Complimentary language courses.

• Flexible working hours.

People also viewed

Advanced Solutions International, Inc.11 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone11 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers