Remotery

Lead Engineer, DevOps – SRE

Posted 4 hours ago

📋 Description

• Take charge of and enhance Launch Potato's cloud infrastructure, CI/CD platform, and compliance framework.

• Establish the Site Reliability Engineering (SRE) function from the ground up, enabling product teams to accelerate shipping without sacrificing reliability, security, or budget control.

• Create the SRE practice from the ground up, including on-call rotation, PagerDuty setup, SLA/SLO definitions for essential infrastructure services, a runbook library, and observability dashboards that connect site performance to business metrics.

• Complete the AWS multi-account migration by transitioning production workloads to an isolated account with zero unplanned downtime.

• Produce a SOC 2 Type I audit-ready infrastructure evidence package, overseeing the technical controls implementation from start to finish.

• Version and publish the Terraform module library (30+ modules) to a private registry, eliminating ad hoc git usage by product teams.

• Implement automated deployment rollback for ECS and Lambda, ensuring that production is contingent upon the successful passage of integration tests.

• Establish monthly cost reporting for leadership, including budget anomaly detection, savings plan recommendations, and expenditure by service/team/environment.


⛳️ Requirements

• A minimum of 5 years of experience in production AWS infrastructure with substantial expertise in Terraform.

• Proven experience in building an SRE function from the ground up, with complete ownership of the process.

• Familiarity with a multi-site organization where PaaS or microservices are essential.

• Previous ownership of CI/CD pipelines in one or more roles.

• Experience with PagerDuty and establishing an on-call rotation.


🏝️ Benefits

• Profit-sharing bonus

• Competitive benefits

People also viewed

PandaDoc1 hour ago

DevOps Engineer

PT flagPortugal OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
PandaDoc1 hour ago

DevOps Engineer

UA flagUkraine OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
PandaDoc1 hour ago

DevOps Engineer

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
PandaDoc1 hour ago

DevOps Engineer

ES flagSpain OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
SupplyHouse.com1 hour ago

Site Reliability Engineer

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$29k – $36k/year
ApplyView job
BI2run4 hours ago

BI DevOps Engineer – m/w/d

DE flagGermany OnlyFull-timeDevOps & Site Reliability Engineer (SRE)€50k – €70k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers