
Site Reliability Engineer
Posted May 25

Posted May 25
This is a fully remote position, open to applicants in Morocco.
• Oversee, sustain, and enhance system availability within a cloud production setting.
• Guarantee the stability and accessibility of cloud production systems.
• Execute monitoring, alerting, and incident management tasks.
• Automate repetitive operational duties and aid in infrastructure enhancements.
• Diagnose intricate issues concerning performance, system reliability, networking, and service integrations.
• Work in conjunction with development and operations teams to improve system performance and mitigate operational risks.
• Engage in on-call rotations and initiatives aimed at continuous improvement.
• Over 8 years of experience in cloud production support or system operations.
• Proficient knowledge of Linux administration.
• Mastery of cloud monitoring and logging tools such as Prometheus, Grafana, Stackdriver, Cloud Logging, Cloud Storage, or their equivalents.
• Proficient in scripting and automation (Python, Bash, or similar languages).
• Familiarity with CI/CD pipelines and DevOps tools.
• Strong grasp of networking fundamentals and VoIP (considered an asset).
• Experience in diagnosing issues in distributed systems and microservice architectures.
• Innovation
• Continuous Learning
• Professional growth
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.