
DevOps Engineer
Posted Jun 3

Posted Jun 3
This is a fully remote position, open to applicants in Germany.
• Take charge of and manage our Kubernetes clusters (AWS EKS for production, bare-metal K3s for machine learning training, and on-premises appliances operating K3s) utilizing GitOps (FluxCD) and Infrastructure as Code (CloudFormation, Ansible).
• Oversee the lifecycle of on-premises gateway appliances implemented at customer locations — including VM image creation, TLS certificate automation, staged rollouts via GitOps, as well as remote monitoring and troubleshooting.
• Establish and sustain monitoring, alerting, and observability infrastructure (Prometheus, Grafana, Loki, Tempo, OpenTelemetry) across both cloud and edge environments.
• Propel compliance automation initiatives — focusing on security hardening, access controls, audit logging, encrypted secrets management (SOPS/KMS), and evidence collection for C5, HIPAA, and HDS certifications.
• Assist in scaling ML training and data processing infrastructure — managing GPU clusters, orchestrating training jobs, and ensuring data pipeline reliability, while collaborating closely with the ML team to enhance advanced speech and language models.
• Handle incident response — encompassing detection, triage, resolution, and conducting post-incident reviews for infrastructure challenges.
• Extensive experience in operating Kubernetes in production environments (cluster management, troubleshooting, networking, storage).
• Familiarity with GitOps workflows.
• Proficient in working with monitoring and observability tools.
• Security-focused — possessing a strong understanding of encryption, access controls, and secrets management.
• Robust Linux systems administration capabilities, along with experience in designing and maintaining CI/CD pipelines.
• Willingness to participate in on-call shift rotations.
• 30 vacation days in addition to your birthday off.
• Germany Transport Ticket.
• Urban Sports Club membership.
• Regular company off-site events.
• Access to learning platforms such as Blinkist and Audible.
• Complimentary language courses.
• Flexible working hours.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.