
Engineering Manager, DevOps
Posted 22 hours ago

Posted 22 hours ago
This is a fully remote position, open to applicants in United States.
• Lead and cultivate a DevOps team: Recruit, mentor, and develop engineers; establish clear expectations and foster an environment of ownership, high standards, and continuous improvement.
• Enhance platform reliability: Take responsibility for the health of CI/CD, deployment, and runtime infrastructure; boost availability, performance, and incident response through measurable SLOs and operational rigor.
• Foster self-service and automation: Develop developer-oriented tools to minimize toil (golden paths, paved roads, templates, and automation for common workflows).
• Advance CI/CD and release engineering: Refine build and deploy pipelines, change management, release safety (progressive delivery, rollbacks), and supply-chain security.
• Implement observability and monitoring: Establish and enhance logging, metrics, and alerting; create dashboards and guardrails that assist teams in understanding and improving system behavior.
• Infrastructure as code: Standardize and scale infrastructure management utilizing modern IaC patterns and review/approval workflows.
• Collaborate on security and compliance: Partner closely with security and compliance stakeholders to ensure secure-by-default systems and audit-ready practices.
• Foster cross-functional collaboration: Collaborate with application teams to enhance deployability and operability, translating business priorities into an actionable roadmap.
• Experience in engineering management: Minimum of 3 years leading and developing software engineers, including coaching, performance management, recruitment, and fostering healthy team practices.
• Extensive hands-on experience: Over 8 years of experience as an individual contributor in DevOps/SRE/platform/infrastructure or software engineering roles.
• Expertise in cloud and systems: Strong experience managing production systems in a major cloud environment (AWS preferred) and ensuring reliability and scalability.
• Proficiency in CI/CD and automation: Demonstrated experience in designing and operating modern build and deploy pipelines and automating operational workflows.
• Knowledge of observability: Experience with logging/metrics/alerting stacks (e.g., Datadog, Grafana, CloudWatch, ELK/OpenSearch) and leveraging them for reliability enhancements.
• Infrastructure as code experience: Familiarity with Terraform, Pulumi, or similar tools and associated engineering practices (code review, testing, drift detection).
• Understanding of containers and runtime: Knowledge of containerization and orchestration (Docker; Kubernetes/ECS preferred).
• Strong communication skills: Ability to align stakeholders, articulate trade-offs, and drive execution across teams and functions.
• Health, dental, and vision insurance
• 100% remote-first culture. Work from anywhere in the US, with all full-time employees having WeWork access
• Unlimited PTO, including competitive vacation and holiday schedules
• Lifestyle stipends - Monthly mental health, wellness & fitness stipend, in-home office setup stipend, and family planning assistance
• Salary top-up during military reserve duty
• Fully paid parental leave
• Child and pet care reimbursement during travel
Investigo
Software Mind
Cherokee Federal
Avaya
Get handpicked remote jobs straight to your inbox weekly.