Remotery

Senior DevOps Engineer

Posted 6 days ago

This is a fully remote position, open to applicants in Poland.

📋 Description

• Contribute to the automation of infrastructure across the AWS platform utilizing Terraform Enterprise, Ansible, Harness, Kubernetes (including operators, CRDs, and cluster lifecycle management), and GitOps methodologies.

• Support and enhance continuous delivery pipelines to ensure dependable and repeatable deployments across various environments (PRE, SBX, PRD).

• Develop and sustain self-service capabilities that empower developers and engineering teams to autonomously claim and utilize infrastructure resources (storage, compute, databases, messaging) via Kubernetes-native APIs and GitOps workflows, eliminating the need for manual intervention.

• Utilize AIOps strategies to enhance platform reliability, including intelligent alerting, anomaly detection, AI-assisted incident triage, and automated remediation.

• Create, maintain, and enhance production runbooks to guarantee that operational procedures are automated, documented, and accessible to the team.

• Provide L2 support for existing automation systems and pipelines, resolving issues across the platform ecosystem.

• Engage in on-call rotations and aid in incident response and post-mortem evaluations.

• Collaborate closely with Production Ops, DBA, and application engineering teams to drive platform enhancements and migrations.

• Contribute to FinOps by identifying and executing AWS cost optimization strategies.

• Document architectural decisions, operational procedures, and partake in team knowledge sharing.

• Take ownership of, automate, and improve Kyriba's storage platform (NetApp, S3, EBS), ensuring reliability, performance, and disaster recovery readiness.

• Design and implement storage architectures on AWS that ensure high availability and fault tolerance.

• Provide L2 support for storage-related incidents, leading investigations on performance degradation, availability challenges, and data integrity events.


⛳️ Requirements

• Bachelor's degree in Computer Science, Engineering, or a related field.

• Over 5 years of hands-on DevOps experience in building and managing a SaaS platform on AWS.

• More than 3 years of experience with Infrastructure as Code (Terraform) in an AWS environment.

• At least 2 years of experience as a platform engineer, contributing to automation and self-service ecosystems.

• Strong knowledge of Kubernetes.

• In-depth understanding of NetApp ONTAP (administration, provisioning, troubleshooting).

• Proficient in AWS storage services (S3, EBS, EFS, FSx).

• Experience in integrating storage with Kubernetes workloads (PV, PVC, StorageClass, CSI drivers).

• Background in providing L2 support for automation systems and/or storage infrastructure.

• Familiarity with AIOps practices: intelligent alerting, anomaly detection, AI-assisted automation (e.g., Datadog AI, GitHub Copilot, or similar).

• Knowledge of CI/CD practices and GitOps workflows (ArgoCD, Kargo, GitHub Actions).

• Experience with monitoring and observability tools (Datadog, Splunk, or similar).

• Strong scripting or development proficiency in Python, Bash, or Go.


🏝️ Benefits

• 15% yearly bonus and annual salary increase based on individual performance.

• MacBook Pro or equivalent equipment provided.

• Access to AI productivity tools (ChatGPT, Copilot).

• Opportunities for professional development: Coursera, Pluralsight, LinkedIn Learning, and conference attendance (e.g., KubeCon).

• International collaboration with DevOps, SRE, and Engineering teams.

• Medical, sports, and life insurance coverage.

• Participation in the Equity Incentive Plan.

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers