
Engineering Lead, Platform Engineering
Posted 20 hours ago

Posted 20 hours ago
This is a fully remote position, open to applicants in Virginia.
• Lead a team of engineers, taking full responsibility for their performance, development, and overall wellbeing.
• Conduct 1:1 meetings, performance evaluations, and career development discussions—offering direct, constructive feedback to facilitate engineers' growth.
• Foster a team culture that emphasizes organization, reliability, and impactful outcomes.
• Guide engineers through code reviews, collaborative work, and coaching on delivery practices.
• Act as a significant technical contributor: share the engineering workload with the team and consistently deliver on high-impact projects.
• Convert architectural vision into actionable sprint-level tasks for the team.
• Develop and enhance CI/CD pipelines and delivery automation to ensure safe, consistent, and rapid deployments.
• Enhance observability and operational readiness through metrics, logging, distributed tracing, and alerting—including actionable dashboards and SLO-based alerts.
• Design and implement automation and self-service workflows utilizing infrastructure-as-code, APIs, and developer platforms to minimize developer friction.
• Oversee incident response coordination: manage the process, communicate updates, and ensure that issues are addressed by the appropriate personnel.
• Participate in the on-call rotation, driving improvements to minimize incidents and reduce alert noise over time.
• Create and maintain operational runbooks, escalation processes, and documentation for systems owned by the team.
• Promote production readiness as an ongoing standard rather than a one-time checklist.
• 7+ years of practical experience in cloud infrastructure/platform engineering, demonstrating growth in scope and leadership responsibility over that period.
• Extensive hands-on experience with Terraform in a production environment (including modules, patterns, environment strategy, and state management).
• Solid hands-on experience operating Kubernetes in a production setting.
• Strong foundational knowledge of AWS: practical experience with compute, networking, IAM, and production operations.
• Proven experience in building and maintaining CI/CD pipelines.
• Strong fundamentals in observability, including metrics, logging, distributed tracing, SLO/SLI design, and alerting strategies.
• Experience in creating automation using Bash and at least one general-purpose programming language (Python or Go).
• Excellent troubleshooting skills: you conduct root cause analysis and implement permanent solutions rather than temporary fixes.
• Hands-on experience with AI coding tools (e.g., GitHub Copilot, Cursor, Claude) as productivity enhancers in production engineering tasks.
• Ability to collaborate effectively with senior technical peers and sound judgment on when to consult versus make independent decisions.
• Comprehensive medical, dental, and vision insurance.
• 20 days of paid time off (PTO) plus 11 paid holidays.
• 401(k) retirement plan with company matching.
• Student loan and tuition reimbursement programs.
• Commuter assistance benefits.
• Parental leave for both maternal and paternal roles.
• Inclusion and associate engagement initiatives.
Tango
Accenture Federal Services
Strategize it Inc.
Accela
Get handpicked remote jobs straight to your inbox weekly.