Remotery

Senior Observability Analyst – SRE, Monitoring

Posted May 20

This is a fully remote position, open to applicants in Brazil.

📋 Description

• Serve as the technical observability leader for high-criticality environments.

• Oversee and enhance solutions such as Datadog, Zabbix, and Grafana.

• Implement and refine APM practices, UX monitoring, traces, metrics, and logs.

• Utilize Azure Monitor and Azure Logs for troubleshooting and event correlation.

• Design and establish alert integrations through PagerDuty.

• Develop and maintain playbooks and runbooks for incident management.

• Assist in root cause analysis and outline preventive measures alongside infrastructure and application teams.


⛳️ Requirements

• Demonstrated experience in Observability, Monitoring, or SRE.

• Advanced expertise with Datadog (APM, UX, traces, dashboards, and alerts).

• Familiarity with Zabbix (infrastructure) and Grafana (integrated dashboards).

• Understanding of Azure Monitor and Azure Logs.

• Experience with ITIL processes (Incident, Problem, and Change Management).

• Proven ability to produce technical documentation (playbooks/runbooks).

• Knowledge of Azure cloud architecture.

• Experience with ITSM tools.

• Practical understanding of SRE methodologies.

• Familiarity with continuous improvement processes (PDCA).


🏝️ Benefits

• Wellhub (Gympass) – Basic plan.

• Life insurance.

• Close support from the staff team and technical mentorship.

• Collaborative environment focused on continuous improvement.

People also viewed

Advanced Solutions International, Inc.10 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone10 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers