
Manager, DevOps Engineering – Observability, Developer Experience
Posted 3 hours ago

Posted 3 hours ago
• Lead and engage with a team of DevOps/SRE engineers dedicated to Observability (monitoring, logging, alerting) and Developer Experience (CI/CD pipelines, internal developer tools).
• Assist in defining the roadmap and vision for observability and developer productivity, ensuring alignment with business and engineering objectives.
• Design, implement, and manage monitoring and logging infrastructure to maintain comprehensive visibility into system health and performance.
• Develop and sustain CI/CD pipelines and automation frameworks that facilitate rapid, secure deployments and enhance developer workflows.
• Collaborate with various engineering and product teams to promote cross-functional initiatives, ensuring reliability and efficiency are integral to our services.
• Establish and disseminate best practices for infrastructure management, incident response, and post-mortem analysis.
• Contribute to the definition and implementation of AI tooling and standards throughout the organization, ensuring developers have access to scalable and secure AI platforms.
• Over 5 years of experience in DevOps, SRE, or Infrastructure Engineering, including some background in leading or mentoring fellow engineers.
• Hands-on proficiency in observability tools and practices (e.g., Prometheus, Grafana, ELK stack) and an understanding of SRE principles.
• Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI) along with Infrastructure as Code methodologies.
• Knowledge of cloud services (AWS and/or GCP) and container orchestration (Kubernetes), with experience in constructing scalable systems.
• Excellent collaboration and communication abilities, with a track record of working across teams (development, QA, product).
• A strong enthusiasm for automation, reliability, and developer efficiency, coupled with a continuous improvement mindset.
• Preferred (Not Essential): Experience in leading DevOps/SRE transformations or implementing reliability engineering practices on a large scale.
• Prior software engineering experience.
• Comprehensive health and wellness programs.
• Opportunities for professional development and career advancement.
• Flexible working arrangements and a supportive work environment.
• Access to cutting-edge tools and technologies.
Tether.to
Tether.to
Polygon Labs
JetBrains
Get handpicked remote jobs straight to your inbox weekly.