
Consultor/a de Observabilidad y Monitorización
Posted 19 hours ago

Posted 19 hours ago
• Governance and Observability Model: Define, maintain, and evolve best practices, standards, and procedures for corporate observability.
• KPIs, SLIs, and SLOs: Design metric models focused on reliability, business outcomes, and user experience, avoiding approaches that rely solely on technical thresholds.
• End-to-End Observability: Ensure proper instrumentation of metrics, logs, and distributed traces across various architecture components.
• Reporting and Dashboards: Design and maintain corporate reporting models and dashboards that provide a clear view of service status.
• Alerting and Noise Management: Define and optimize advanced alerting models integrated with ITSM processes, enhancing early detection and reducing unnecessary alerts.
• Functional Monitoring: Oversee the quality of instrumentation and the consistency of observed data without assuming daily operational responsibilities of platforms.
• Continuous Improvement: Participate in post-mortem reviews, root cause analysis, and continuous improvement processes for service reliability and quality.
• Multidisciplinary Collaboration: Continuously coordinate with architecture, development, operations, and technology vendor teams.
• Functional Reference: Serve as a functional reference point regarding observability for various technical teams.
• Between 3 to 6 years of experience in complex IT environments.
• Previous experience in governance, design, or evolution of observability, monitoring, APM, or SRE services.
• Knowledge and experience in end-to-end observability: metrics, logs, and distributed traces.
• Functional knowledge of observability/APM platforms such as Dynatrace, Datadog, New Relic, or AppDynamics, including the definition and management of SLOs, SLIs, and SLAs.
• General understanding of application architectures, middleware, infrastructure, and cloud environments.
• Experience in advanced alerting models, noise reduction, and integrating observability with IT incident, problem, and change processes.
• Familiarity with ITSM and ITIL methodologies, particularly in Incident Management, Problem Management, and Continuous Improvement.
• Ability to create clear and structured technical and procedural documentation.
• Previous experience in regulated corporate environments such as banking, insurance, or utilities is a plus.
• Teamwork skills and effective communication with technical teams, business stakeholders, and vendors.
• Composed of over 3,000 creative, digital, and innovative individuals connected by a purpose and capable of forming connections with people worldwide.
• A responsible team that is flexible and highly adaptable to the needs of our clients and the market.
• Commitment to equal opportunities and respect for diversity.
GE Aerospace
Trinity Life Sciences
Sedgwick
Get handpicked remote jobs straight to your inbox weekly.