
DevOps Engineer – Observability
Posted May 24

Posted May 24
This is a fully remote position, open to applicants in Bulgaria.
• Design, implement, and enhance reliable, scalable, high-performance, and secure production platforms and services.
• Collaborate closely with cross-functional teams to construct and maintain resilient infrastructure and deployment patterns.
• Provide support and guidance to engineers throughout the organization, fostering strong engineering standards and operational best practices.
• Engage in a 24x7 on-call rotation to support critical services and ensure platform availability.
• Promote standardization, automation, and documentation to enhance consistency, minimize operational overhead, and facilitate knowledge sharing.
• Contribute to all phases of platform and service delivery, from design and construction to operation and optimization.
• A minimum of 3 years of experience in DevOps, Site Reliability Engineering (SRE), platform engineering, or software engineering roles.
• Extensive experience with Kubernetes, coupled with a solid understanding of containers and container orchestration.
• Practical experience with infrastructure as code tools such as Terraform, Ansible, or Puppet.
• Proficiency in at least one object-oriented programming language, alongside scripting and automation capabilities.
• Strong knowledge of security principles and best practices across infrastructure, platforms, and services.
• Practical experience with one or more major cloud platforms, with familiarity in AWS, GCP, or OCI.
• Experience in monitoring, alerting, and observability using tools such as Prometheus, Grafana, or other similar platforms.
• A solid understanding of networking fundamentals and distributed systems.
• Strong experience in Linux and/or Windows systems administration.
• Familiarity with software delivery automation, CI/CD pipelines, and secure Software Development Life Cycle (SDLC) practices, including exposure to static and dynamic security testing.
• A good grasp of SRE concepts such as Service Level Indicators (SLIs), Service Level Objectives (SLOs), Service Level Agreements (SLAs), toil reduction, availability, and observability.
• Experience in managing and supporting Elasticsearch in production is advantageous.
• Additional Premium Health insurance
• Life Insurance
• Food vouchers amounting to 70 Euro per month
• Holiday entitlement: 25 days vacation and 4 Wellness days per year
• Flexible benefits (Re:benefits)
• Full Calm subscription
• 24/7 Employment Assistance Program
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.