
Principal DevOps Engineer
Posted 1 day ago

Posted 1 day ago
This is a fully remote position, open to applicants in United States.
• Establish the long-term technical vision for our cloud platform, providing guidance to leadership on architectural strategy, investment priorities, and systemic risk management.
• Set organizational benchmarks for cloud reliability, observability, and operational excellence, encompassing SLO/SLI frameworks, incident management protocols, and the foundational tooling that supports them.
• Act as the primary solutions engineering authority for partner teams, converting cross-functional requirements into platform strategies and prioritizing investments in developer experience.
• Take ownership of the developer experience roadmap by identifying and addressing systemic gaps in self-service infrastructure, CI/CD workflows, and operational visibility throughout engineering.
• Proactively identify systemic risks within our AWS infrastructure, IaC practices, and delivery pipelines, driving organizational initiatives before potential issues escalate into incidents.
• Convert complex infrastructure risks and platform strategies into clear, business-aligned narratives for Director-level and executive stakeholders to facilitate resourcing and prioritization decisions.
• Mentor Staff (T4) and Senior (T3) engineers, exemplifying the culture of engineering rigor, documentation, and cross-functional accountability that is expected across the organization.
• Bachelor’s Degree in Computer Science, Engineering, or a related field.
• Over 8 years of experience in cloud platform or infrastructure engineering with a proven track record of technical impact at the organizational level.
• In-depth AWS expertise across compute (EC2, ECS, Lambda), networking (VPCs, Transit Gateway, Route 53, CloudFront), data (RDS, S3), and security (IAM, Secrets Manager, Systems Manager), with a history of making high-stakes architectural decisions.
• Extensive knowledge of cloud reliability and operational excellence, including SLO/SLI design, centralized observability, alerting strategies, and incident lifecycle management at production scale.
• Expert-level experience with Terraform and IaC platform leadership, including organizational module strategy, policy-as-code, and self-service provisioning standards that can scale across teams with diverse infrastructure maturity.
• Demonstrated ability as a solutions engineering partner and advocate for developer experience, collaborating with teams to identify needs, prioritize initiatives, and deliver platform capabilities that minimize toil and enhance engineering velocity.
• Exceptional communication skills at the executive level, capable of translating infrastructure risks and platform strategies into business-aligned narratives for Director and VP audiences, fostering alignment without formal authority.
• Proven history of mentoring Staff and Senior engineers, establishing organization-wide engineering standards, and fostering a culture of technical accountability and documentation rigor.
• Ability to translate business objectives and organizational risks into multi-year platform roadmaps, sequencing initiatives based on strategic impact and advocating for long-term technical investments.
• Experience in establishing CI/CD governance models and delivery philosophies at an organizational level, defining standards and self-service patterns that teams can build upon, without directly managing pipeline implementations.
• Comprehensive healthcare coverage.
• Stock awards.
• Monthly wellness stipend.
• Retirement savings match.
• Lifetime Headspace membership.
• Generous parental leave.
EXL
Headspace
Allstate
Sargent & Lundy
Get handpicked remote jobs straight to your inbox weekly.