
Senior Observability Analyst – SRE, Monitoring
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Brazil.
• Serve as the technical observability leader for high-criticality environments.
• Oversee and enhance solutions such as Datadog, Zabbix, and Grafana.
• Implement and refine APM practices, UX monitoring, traces, metrics, and logs.
• Utilize Azure Monitor and Azure Logs for troubleshooting and event correlation.
• Design and establish alert integrations through PagerDuty.
• Develop and maintain playbooks and runbooks for incident management.
• Assist in root cause analysis and outline preventive measures alongside infrastructure and application teams.
• Demonstrated experience in Observability, Monitoring, or SRE.
• Advanced expertise with Datadog (APM, UX, traces, dashboards, and alerts).
• Familiarity with Zabbix (infrastructure) and Grafana (integrated dashboards).
• Understanding of Azure Monitor and Azure Logs.
• Experience with ITIL processes (Incident, Problem, and Change Management).
• Proven ability to produce technical documentation (playbooks/runbooks).
• Knowledge of Azure cloud architecture.
• Experience with ITSM tools.
• Practical understanding of SRE methodologies.
• Familiarity with continuous improvement processes (PDCA).
• Wellhub (Gympass) – Basic plan.
• Life insurance.
• Close support from the staff team and technical mentorship.
• Collaborative environment focused on continuous improvement.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.