Remotery

DevOps – Infrastructure Engineer

Posted May 2

📋 Description

• Design, construct, and maintain our observability platform, including metrics, logs, traces, and everything in between.

• Engage directly with infrastructure: deploy services, resolve incidents, and rectify issues when they arise (which they inevitably will).

• Instrument applications and services to gather significant telemetry data that provides actionable insights.

• Create dashboards and alerting systems that are genuinely useful to teams—not just sources of noise.

• Investigate production issues, correlate data across different systems, and spearhead root cause analysis.

• Advocate for observability best practices among engineering teams and assist developers in instrumenting their own code.

• Automate as much as possible: infrastructure provisioning, deployment pipelines, and operational runbooks.

• Collaborate closely with SRE and development teams to enhance system reliability and performance.

• Assess and incorporate new observability tools and technologies as the landscape evolves.


⛳️ Requirements

• A minimum of 3 years of experience in DevOps, Infrastructure, or SRE roles—with tangible production experience.

• Extensive hands-on experience with observability tools such as Prometheus, Grafana, Datadog, New Relic, Splunk, ELK stack, Jaeger, or similar.

• Strong expertise in cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code (Terraform, Pulumi, CloudFormation).

• Proficient scripting and automation skills (Python, Bash, Go, or similar).

• Experience with containerization and orchestration technologies (Docker, Kubernetes).

• A solid understanding of distributed systems, microservices architectures, and their specific observability challenges.

• Familiarity with CI/CD pipelines and GitOps workflows.

• Exceptional troubleshooting abilities—you are the individual who perseveres until the root cause is identified.


🏝️ Benefits

• Competitive salary and equity package.

• Flexible working arrangements.

• Learning and development budget.

• Modern tech stack and the freedom to make a meaningful impact.

• A team that prioritizes quality execution over speed.

People also viewed

Arctiq18 hours ago

Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Arctiq18 hours ago

Senior Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind18 hours ago

Senior DevOps Manager, German speaking

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mediastream18 hours ago

DevOps Engineer

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kyndryl18 hours ago

Site Reliability Engineer

US flagOhio OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$161.5k – $290.8k/year
ApplyView job
Guidehouse18 hours ago

Senior Azure DevOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118k – $196k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers