
DevOps Engineer, Splunk
Posted 3 days ago

Posted 3 days ago
This is a fully remote position, open to applicants in India.
• Design and execute comprehensive observability solutions across applications, infrastructure, and cloud environments.
• Create dashboards, alerts, and telemetry frameworks to deliver real-time insights into system health and performance.
• Develop automation solutions to remove repetitive operational tasks and enhance efficiency.
• Facilitate runbook automation, self-healing mechanisms, and automated incident triage workflows.
• Establish and implement SLIs, SLOs, and alerting strategies to enhance service reliability.
• Propel improvements in MTTD and MTTR through actionable alerts and insights driven by telemetry.
• Implement proactive monitoring, anomaly detection, and predictive alerting to recognize issues prior to customer impact.
• Utilize AIOps capabilities for alert correlation and intelligent incident response.
• Integrate observability platforms with CI/CD pipelines, cloud services, and ITSM tools like ServiceNow.
• Partner with engineering, product, and operations teams to set observability standards and operational readiness practices.
• A minimum of 3 years of experience in Observability Engineering, Site Reliability Engineering, or related fields.
• Practical experience with observability platforms such as Splunk, Dynatrace, Grafana, and OpenTelemetry.
• Strong knowledge of AWS and GCP, along with familiarity with cloud-native architectures.
• Proficient in Python for automation and operational tooling.
• Experience in implementing metrics, logs, events, and distributed tracing (MELT) across distributed systems.
• Hands-on experience with Terraform and Infrastructure as Code methodologies.
• Solid understanding of SLIs, SLOs, alerting strategies, and incident response frameworks.
• Outstanding troubleshooting, communication, and collaboration abilities.
• Bachelor's degree in Computer Science, Information Technology, or a related area (or equivalent experience).
• Culture of Relentless Performance: join an unstoppable technology development team with a 99% project success rate and over 30% year-over-year revenue growth.
• Competitive Pay and Benefits: enjoy a comprehensive compensation and benefits package, including health insurance, language courses, and a relocation program.
• Work From Anywhere Culture: take advantage of the flexibility offered by remote work.
• Growth Mindset: benefit from a variety of professional development opportunities, including certification programs, mentorship and talent investment initiatives, internal mobility, and internship options.
• Global Impact: collaborate on significant projects for top global clients and help shape the future of industries.
• Welcoming Multicultural Environment: be part of a dynamic, global team and excel in an inclusive and supportive work atmosphere with open communication and regular team-building social events.
• Social Sustainability Values: engage in our sustainable business practices focused on five pillars, including IT education, community empowerment, fair operating practices, environmental sustainability, and gender equality.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.