
DevOps – Infrastructure Engineer
Posted May 2

Posted May 2
• Design, construct, and maintain our observability platform, including metrics, logs, traces, and everything in between.
• Engage directly with infrastructure: deploy services, resolve incidents, and rectify issues when they arise (which they inevitably will).
• Instrument applications and services to gather significant telemetry data that provides actionable insights.
• Create dashboards and alerting systems that are genuinely useful to teams—not just sources of noise.
• Investigate production issues, correlate data across different systems, and spearhead root cause analysis.
• Advocate for observability best practices among engineering teams and assist developers in instrumenting their own code.
• Automate as much as possible: infrastructure provisioning, deployment pipelines, and operational runbooks.
• Collaborate closely with SRE and development teams to enhance system reliability and performance.
• Assess and incorporate new observability tools and technologies as the landscape evolves.
• A minimum of 3 years of experience in DevOps, Infrastructure, or SRE roles—with tangible production experience.
• Extensive hands-on experience with observability tools such as Prometheus, Grafana, Datadog, New Relic, Splunk, ELK stack, Jaeger, or similar.
• Strong expertise in cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code (Terraform, Pulumi, CloudFormation).
• Proficient scripting and automation skills (Python, Bash, Go, or similar).
• Experience with containerization and orchestration technologies (Docker, Kubernetes).
• A solid understanding of distributed systems, microservices architectures, and their specific observability challenges.
• Familiarity with CI/CD pipelines and GitOps workflows.
• Exceptional troubleshooting abilities—you are the individual who perseveres until the root cause is identified.
• Competitive salary and equity package.
• Flexible working arrangements.
• Learning and development budget.
• Modern tech stack and the freedom to make a meaningful impact.
• A team that prioritizes quality execution over speed.
Arctiq
Arctiq
Software Mind
Mediastream
Get handpicked remote jobs straight to your inbox weekly.