
Junior Observability Engineer
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in India.
• Design and execute comprehensive observability solutions across applications, infrastructure, and cloud environments.
• Create dashboards, alerts, and telemetry frameworks to deliver real-time insights into system health and performance.
• Develop automation solutions to eliminate repetitive operational tasks, enhancing overall efficiency.
• Facilitate runbook automation, self-healing capabilities, and automated incident triage workflows.
• Establish and implement SLIs, SLOs, and alerting strategies to enhance service reliability.
• Propel improvements in MTTD and MTTR through actionable alerts and insights driven by telemetry.
• Implement proactive monitoring, anomaly detection, and predictive alerting to detect issues prior to customer impact.
• Utilize AIOps capabilities for alert correlation and intelligent incident response.
• Integrate observability platforms with CI/CD pipelines, cloud services, and ITSM tools like ServiceNow.
• Work collaboratively with engineering, product, and operations teams to set observability standards and operational readiness practices.
• A minimum of 3 years of experience in Observability Engineering, Site Reliability Engineering, or related fields.
• Practical experience with observability platforms such as Splunk, Dynatrace, Grafana, and OpenTelemetry.
• Strong knowledge of AWS and GCP, along with familiarity with cloud-native architectures.
• Proficient in Python for automation and operational tooling purposes.
• Experience in implementing metrics, logs, events, and distributed tracing (MELT) across distributed systems.
• Hands-on experience with Terraform and Infrastructure as Code methodologies.
• Solid understanding of SLIs, SLOs, alerting strategies, and incident response frameworks.
• Exceptional troubleshooting, communication, and collaboration capabilities.
• Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
• Culture of Relentless Performance: join an unstoppable technology development team with a 99% project success rate and over 30% year-over-year revenue growth.
• Competitive Pay and Benefits: enjoy a comprehensive compensation and benefits package, including health insurance, language courses, and a relocation program.
• Work From Anywhere Culture: make the most of the flexibility that comes with remote work.
• Growth Mindset: take advantage of a variety of professional development opportunities, including certification programs, mentorship and talent investment initiatives, as well as internal mobility and internship opportunities.
• Global Impact: collaborate on meaningful projects for top global clients and influence the future of industries.
• Welcoming Multicultural Environment: be part of a dynamic, global team and thrive in an inclusive and supportive work atmosphere with open communication and regular team-building social events.
• Social Sustainability Values: join our sustainable business practices focused on five pillars, including IT education, community empowerment, fair operating practices, environmental sustainability, and gender equality.
Akka (formerly Lightbend)
Swimlane
Get handpicked remote jobs straight to your inbox weekly.