
Senior Site Reliability Engineer
Posted May 25

Posted May 25
This is a fully remote position, open to applicants in India.
• Conducting investigations and resolving networking issues within a Linux-based networking environment.
• Overseeing the operation and performance of the networking infrastructure using Prometheus metrics and Grafana dashboards.
• Addressing complex challenges promptly and accurately while preventing future occurrences through proactive troubleshooting, automation, and systems programming.
• Developing software tools and systems to automate analytical tasks and workflows, enhancing efficiency and reliability.
• Utilizing skills in data analysis, network diagnostics, and debugging tools to assess performance and suggest improvements.
• Possess 5+ years of experience in a Site Reliability or System Engineering role, along with a bachelor's degree in computer science or a related discipline.
• Demonstrate expertise in L7 traffic management (Envoy, HAProxy, NGINX) within large-scale distributed systems.
• Be skilled in coding with Python, Perl, R, Java, or SQL, and have knowledge of networking concepts such as routing, firewalls, and DNS.
• Have experience with Linux systems and tools like netstat, traceroute, and tcpdump.
• Be adept in configuration management and container technologies, including Ansible, Salt Stack, Chef, Puppet, Terraform, Docker, Podman, Kubernetes, and Nomad.
• We prioritize your health, well-being, financial stability, and life beyond work. Explore our benefits.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.