
Observability Consultant
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Brazil.
• Participate in the implementation, administration, maintenance, and advancement of the observability platform.
• Set up and manage observability tools, oversee agents and collectors, and enforce retention policies along with performance optimization.
• Create dashboards, alerts, and workflows, and engage in incident troubleshooting.
• Extensive experience with Datadog or Elasticsearch, encompassing implementation, administration, maintenance, and platform evolution.
• Proficient in configuring and operating the tool, including management of agents and collectors, retention policies, performance optimization, licensing, and platform governance.
• Familiarity with application instrumentation.
• Practical understanding of OpenTelemetry, distributed telemetry, and contemporary observability practices.
• Capability to analyze and correlate metrics, logs, and traces.
• Background in advanced troubleshooting, incident investigation, profiling, tracing, and root cause analysis.
• Experience in developing dashboards, alerts, queries, notebooks, and workflows within the tool.
• Knowledge of integrations through APIs, webhooks, and native connectors, including scenarios involving ITSM/CMDB and monitoring tools.
• Experience with cloud environments and distributed applications.
• Familiarity with Kubernetes/EKS and monitoring/instrumentation of containerized workloads.
• Understanding of agile methodologies such as Scrum and Kanban.
• Preferred: advanced expertise in Datadog or Elasticsearch.
• Experience in high-data-volume settings with multiple services and distributed architectures.
• Background in 24/7 operations and situations that require high availability and resilience.
• Experience supporting mission-critical applications, ideally in sectors with significant operational demands such as retail, finance, logistics, or e-commerce.
• Knowledge of observability as it pertains to microservices, APIs, messaging, and hybrid/cloud environments.
• Experience in integrating observability with incident management and problem management processes.
• Experience with observability automation as code.
• Competitive salary and performance-based bonuses.
• Comprehensive health, dental, and vision insurance.
• Opportunities for professional development and career advancement.
• Flexible work hours and remote working options.
• A collaborative and innovative work environment.
Kainos
TecnoSpeed TI
ValueNet Group
Get handpicked remote jobs straight to your inbox weekly.