Remotery

Senior Site Reliability Engineer, Observability

Posted May 20

This is a fully remote position, open to applicants in Argentina.

📋 Description

• Enhance the reliability and stability of Webflow’s production infrastructure that is customer-facing.

• Ensure the platform's security and scalability as user projects are initiated.

• Assist in defining and implementing observability practices, empowering engineers to confidently deploy and manage services in production.

• Develop and maintain AI-driven agents and automation to help engineers quickly obtain insights, minimize alert fatigue, and expedite incident resolution.

• Engage in and refine on-call and incident response procedures.


⛳️ Requirements

• A BS/BA degree or equivalent experience.

• Proficient in English, capable of reading, writing, and speaking at a business level.

• Over 5 years of experience in building, maintaining, and troubleshooting distributed systems in a customer-facing environment with minimal downtime.

• Practical experience with observability platforms and tools such as Datadog, Grafana, Prometheus, ElasticSearch, or similar.

• Familiarity with OpenTelemetry or comparable instrumentation frameworks for gathering metrics, traces, profiles, and logs across distributed services.

• Proven experience in defining and operationalizing SLOs/SLIs at scale.

• Skilled in navigating and scaling multi-tier cloud environments on AWS or GCP.

• Knowledgeable in container-centric architectures developed with tools like Docker and Kubernetes (EKS, GKE, AKS, etc.), or ECS.

• Proficient with infrastructure-as-code tools such as Terraform or Pulumi.

• Experience contributing to full-stack applications developed using technologies like React, Node.js, and MongoDB or PostgreSQL.


🏝️ Benefits

• Ownership in the projects you contribute to.

• Comprehensive health coverage that meets your needs.

• Support throughout all stages of family life.

• Time off that truly allows you to disconnect.

• Wellness programs that support your overall well-being.

• Investment in your future growth.

• Monthly stipends that adapt to your lifestyle.

• Bonuses for collaborative achievements.

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers