
Software Engineer V β Infra/SRE
Posted May 6

Posted May 6
This is a fully remote position, open to applicants in United States.
β’ Overseeing the development and execution of a thorough monitoring strategy for a high-availability application, balancing short-term operational demands with long-term sustainability goals.
β’ Constructing and sustaining observability infrastructure utilizing Terraform, integrating AWS-native monitoring solutions with New Relic to ensure comprehensive full-stack visibility.
β’ Collaborating closely with application engineers focused on TypeScript/JavaScript services operating on AWS ECS, along with RDS and EventBridge in the architecture β gaining a deep understanding of the application to effectively instrument it.
β’ Setting reliability standards, creating runbooks, defining alerting thresholds, and establishing incident response practices that the wider team can manage and operate.
β’ Leading and guiding a technical team, establishing direction, removing obstacles for others, and mentoring engineers through complex and high-pressure scenarios.
β’ Engaging directly with government stakeholders to convey the reliability status of the application, highlight risks, and instill confidence in the systems for which you are accountable.
β’ Over 10 years of engineering experience, with considerable time dedicated to SRE, platform, or infrastructure-centric roles.
β’ Practical experience in building and managing infrastructure using Terraform within AWS environments.
β’ Profound understanding of AWS observability tools and services, including hands-on experience with ECS, RDS, and EventBridge.
β’ Proven experience in implementing and managing APM and monitoring solutions such as New Relic.
β’ Capability to read, comprehend, and work alongside TypeScript/JavaScript application codebases β sufficient to instrument effectively and debug across the technology stack.
β’ Experience operating systems that handle personally identifiable information (PII), demonstrating sound judgment regarding the operational and security practices required.
β’ Proven track record of leading a technical team in a high-trust, fast-paced environment β setting direction, upholding standards, and fostering the development of team members.
β’ Experience working with or alongside government agencies, possessing an understanding of the organizational dynamics and constraints involved.
β’ Excellent communication skills for both technical and non-technical audiences, including the ability to translate complex reliability concepts for stakeholders lacking an engineering background.
β’ A sense of curiosity, patience, and resilience when navigating ambiguous or rapidly evolving environments.
β’ A Bachelor's degree (or equivalent experience) is contractually required for this position.
β’ Offers Bonus
β’ Profit sharing bonus available after 90 days
HealthEdge
Equinix
Calendly
GFT Technologies
Get handpicked remote jobs straight to your inbox weekly.