
Senior Site Reliability Engineer
Posted May 25

Posted May 25
This is a fully remote position, open to applicants in Poland.
• Collaborate with teams to enhance service delivery and reliability throughout their lifecycle.
• Assess and oversee all production systems with a focus on availability, latency, and overall system performance.
• Investigate the root causes of errors and instability in our production cloud services, guiding teams towards improved operational excellence.
• Partner with product and platform teams to enhance and evolve systems by advocating for changes that boost reliability, resilience, and observability.
• Assist in identifying and reducing toil through innovative solutions and automation.
• This role will involve standby, on-call, or off-hours responsibilities.
• Demonstrated experience in designing, implementing, and managing observability systems for intricate cloud-based platforms.
• Familiarity with Configuration Management and Infrastructure as Code tools such as Terraform (preferred) or Ansible.
• Understanding of cloud platforms, preferably AWS and Azure.
• Proficient in APM and observability tools, including New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry, etc.
• Extensive experience in enterprise-scale continuous delivery environments.
• Development experience with JavaScript/Node.js/TypeScript in a Linux/Mac environment.
• Experience in sustainable incident response within a blameless culture.
• Background in Linux Systems Engineering.
• Proficient with incident response tools such as PagerDuty, FireHydrant, Blameless, etc.
• Comfortable working autonomously and collaborating with a distributed team.
• Knowledgeable about cloud and application security best practices.
• Strong understanding of cloud design patterns for scalability, data management, resiliency, and more.
• Passion for high-quality outcomes and a talent for testing.
• Possesses views on business metrics and service level objectives (SLOs).
• Diversity fosters innovation and leads to better decision-making.
• Emphasis on a remote-first culture.
• An inclusive environment that welcomes and values differences.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.