This is a fully remote position, open to applicants in Bulgaria.

📋 Description

• Design, implement, and enhance reliable, scalable, high-performance, and secure production platforms and services.

• Collaborate closely with cross-functional teams to construct and maintain resilient infrastructure and deployment patterns.

• Provide support and guidance to engineers throughout the organization, fostering strong engineering standards and operational best practices.

• Engage in a 24x7 on-call rotation to support critical services and ensure platform availability.

• Promote standardization, automation, and documentation to enhance consistency, minimize operational overhead, and facilitate knowledge sharing.

• Contribute to all phases of platform and service delivery, from design and construction to operation and optimization.

⛳️ Requirements

• A minimum of 3 years of experience in DevOps, Site Reliability Engineering (SRE), platform engineering, or software engineering roles.

• Extensive experience with Kubernetes, coupled with a solid understanding of containers and container orchestration.

• Practical experience with infrastructure as code tools such as Terraform, Ansible, or Puppet.

• Proficiency in at least one object-oriented programming language, alongside scripting and automation capabilities.

• Strong knowledge of security principles and best practices across infrastructure, platforms, and services.

• Practical experience with one or more major cloud platforms, with familiarity in AWS, GCP, or OCI.

• Experience in monitoring, alerting, and observability using tools such as Prometheus, Grafana, or other similar platforms.

• A solid understanding of networking fundamentals and distributed systems.

• Strong experience in Linux and/or Windows systems administration.

• Familiarity with software delivery automation, CI/CD pipelines, and secure Software Development Life Cycle (SDLC) practices, including exposure to static and dynamic security testing.

• A good grasp of SRE concepts such as Service Level Indicators (SLIs), Service Level Objectives (SLOs), Service Level Agreements (SLAs), toil reduction, availability, and observability.

• Experience in managing and supporting Elasticsearch in production is advantageous.

🏝️ Benefits

• Additional Premium Health insurance

• Life Insurance

• Food vouchers amounting to 70 Euro per month

• Holiday entitlement: 25 days vacation and 4 Wellness days per year

• Flexible benefits (Re:benefits)

• Full Calm subscription

• 24/7 Employment Assistance Program

DevOps Engineer – Observability

📋 Description

⛳️ Requirements

🏝️ Benefits

People also viewed

DevOps Reliability Engineer

Senior Site Reliability Engineer – Network

Staff Site Reliability Engineer

DevOps Engineer, Mid Level

DevOps Engineer, Azure

DevOps Engineer, mk8s

Never miss a great job!