
Observability Engineer
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in India.
• Create and implement comprehensive observability solutions spanning applications, infrastructure, and cloud environments.
• Design dashboards, alerts, and telemetry frameworks to ensure real-time insights into system health and performance.
• Develop automation solutions to reduce repetitive operational tasks and enhance efficiency.
• Facilitate runbook automation, self-healing capabilities, and automated incident triage workflows.
• Establish and execute SLIs, SLOs, and alerting strategies to enhance service reliability.
• Promote improvements in Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR) through actionable alerts and insights driven by telemetry.
• Implement proactive monitoring, anomaly detection, and predictive alerting to identify potential issues before they affect customers.
• Utilize AIOps capabilities for alert correlation and smart incident response.
• Integrate observability platforms with CI/CD pipelines, cloud services, and IT Service Management (ITSM) tools like ServiceNow.
• Collaborate with engineering, product, and operations teams to set observability standards and ensure operational readiness.
• Guide teams and promote the adoption of observability best practices throughout the organization.
• 4+ years of experience in Observability Engineering, Site Reliability Engineering, or related fields.
• Practical experience with observability platforms such as Dynatrace, Splunk, Grafana, and OpenTelemetry.
• Strong knowledge of AWS and GCP, along with familiarity with cloud-native architectures.
• Proficient in Python for automation and operational tooling.
• Experience in implementing metrics, logs, events, and distributed tracing (MELT) across distributed systems.
• Hands-on experience with Terraform and Infrastructure as Code practices.
• Solid understanding of SLIs, SLOs, alerting strategies, and incident response frameworks.
• Excellent troubleshooting, communication, and collaboration abilities.
• Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
• Culture of Relentless Performance: join an unstoppable technology development team with a 99% project success rate and over 30% year-over-year revenue growth.
• Competitive Pay and Benefits: enjoy a comprehensive compensation and benefits package, including health insurance, language courses, and a relocation program.
• Work From Anywhere Culture: take advantage of the flexibility that comes with remote work.
• Growth Mindset: benefit from a range of professional development opportunities, including certification programs, mentorship and talent investment programs, internal mobility, and internship opportunities.
• Global Impact: collaborate on meaningful projects for leading global clients and help shape the future of various industries.
• Welcoming Multicultural Environment: become part of a dynamic, global team and thrive in an inclusive and supportive work environment, featuring open communication and regular team-building social events.
• Social Sustainability Values: engage in sustainable business practices focused on five pillars, including IT education, community empowerment, fair operating practices, environmental sustainability, and gender equality.
TechBiz Global
ALTEN
Seekerh
Get handpicked remote jobs straight to your inbox weekly.