
Operations Engineer
Posted May 19

Posted May 19
This is a fully remote position, open to applicants in Azerbaijan.
• Act as the main dashboard overseer throughout your shift
• Assess and investigate production incidents
• Manage lower-severity incidents comprehensively from detection to resolution
• Assist the TSO Lead during significant incidents
• Prepare incident communications under the guidance of the TSO Lead
• Examine incident patterns, recurring problems, and production bugs
• Assemble incident timelines and create initial PIR documents
• Develop and uphold operational automation
• Execute organized shift handoffs
• Provide coverage for the TSO Lead during their time off
• Over 4 years of experience in SRE, DevOps, production operations, NOC, or technical operations within a high-availability setting
• Excellent troubleshooting and investigative abilities
• Practical experience with Datadog (or a comparable observability tool)
• Proficient in at least one scripting language: Python, Go, or Bash
• Strong written and verbal communication skills in English
• Familiarity with Kubernetes and cloud infrastructure
• Knowledge of SLOs, error budgets, and burn-rate alerting
• Experience with incident management tools: JIRA or JIRA Service Management
• Background in or keen interest in AI/ML-assisted operations
• Willingness to work in a 24x7 shift-based environment
• Health insurance
• Remote work options
Remote
Get handpicked remote jobs straight to your inbox weekly.