Remotery

Software Engineer V – Infra/SRE

Posted May 6

This is a fully remote position, open to applicants in United States.

πŸ“‹ Description

β€’ Overseeing the development and execution of a thorough monitoring strategy for a high-availability application, balancing short-term operational demands with long-term sustainability goals.

β€’ Constructing and sustaining observability infrastructure utilizing Terraform, integrating AWS-native monitoring solutions with New Relic to ensure comprehensive full-stack visibility.

β€’ Collaborating closely with application engineers focused on TypeScript/JavaScript services operating on AWS ECS, along with RDS and EventBridge in the architecture β€” gaining a deep understanding of the application to effectively instrument it.

β€’ Setting reliability standards, creating runbooks, defining alerting thresholds, and establishing incident response practices that the wider team can manage and operate.

β€’ Leading and guiding a technical team, establishing direction, removing obstacles for others, and mentoring engineers through complex and high-pressure scenarios.

β€’ Engaging directly with government stakeholders to convey the reliability status of the application, highlight risks, and instill confidence in the systems for which you are accountable.


⛳️ Requirements

β€’ Over 10 years of engineering experience, with considerable time dedicated to SRE, platform, or infrastructure-centric roles.

β€’ Practical experience in building and managing infrastructure using Terraform within AWS environments.

β€’ Profound understanding of AWS observability tools and services, including hands-on experience with ECS, RDS, and EventBridge.

β€’ Proven experience in implementing and managing APM and monitoring solutions such as New Relic.

β€’ Capability to read, comprehend, and work alongside TypeScript/JavaScript application codebases β€” sufficient to instrument effectively and debug across the technology stack.

β€’ Experience operating systems that handle personally identifiable information (PII), demonstrating sound judgment regarding the operational and security practices required.

β€’ Proven track record of leading a technical team in a high-trust, fast-paced environment β€” setting direction, upholding standards, and fostering the development of team members.

β€’ Experience working with or alongside government agencies, possessing an understanding of the organizational dynamics and constraints involved.

β€’ Excellent communication skills for both technical and non-technical audiences, including the ability to translate complex reliability concepts for stakeholders lacking an engineering background.

β€’ A sense of curiosity, patience, and resilience when navigating ambiguous or rapidly evolving environments.

β€’ A Bachelor's degree (or equivalent experience) is contractually required for this position.


🏝️ Benefits

β€’ Offers Bonus

β€’ Profit sharing bonus available after 90 days

People also viewed

HealthEdge48 min ago

Senior Release Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$108k – $115k/year
ApplyView job
Equinix1 hour ago

Senior Staff Engineer, SRE/DevOps, Produit Logiciel

US flagTexas OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$136k – $204k/year
ApplyView job
Calendly1 hour ago

Senior Site Reliability Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$198k – $288k/year
ApplyView job
GFT Technologies1 hour ago

DevOps Cloud Networking Engineer – English Advanced

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Hotel Engine1 hour ago

Senior Software Engineer, DevOps/Infrastructure

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$121.4k – $168k/year
ApplyView job
Solace1 hour ago

Senior Cloud Site Reliability Engineer

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers