
Site Reliability Engineer
Posted 21 hours ago

Posted 21 hours ago
This is a fully remote position, open to applicants in United States.
• Develop and implement a comprehensive infrastructure strategy that proactively addresses changing business needs and promotes operational excellence.
• Take ownership of the reliable delivery of complex technical solutions through extensive automation utilizing Kubernetes and advanced CI/CD pipelines.
• Ensure high portal availability and system health by applying cutting-edge observability and distributed tracing techniques.
• Lead high-severity incident response initiatives and foster systemic enhancements through thoughtful, blameless postmortem evaluations.
• Design failure-resilient and self-healing infrastructure systems to guarantee continuous operational stability and prevent data loss.
• Act as the internal subject matter expert to guide software architecture decisions to achieve optimal scalability and performance.
• Conduct regular knowledge-sharing and training sessions to uplift technical standards and process reliability across the entire technology department.
• Oversee security initiatives and devise secure networking strategies to uphold a high-standard protection framework for all client data and assets.
• 4–7 years of professional experience in constructing and managing resilient, modern infrastructure within a dynamic environment.
• Expert-level skills in managing and troubleshooting Linux-based servers across various distributions.
• Advanced expertise in creating modular, reusable infrastructure templates utilizing tools such as Terraform and Ansible.
• Proven track record of managing containerized workloads at scale using Kubernetes and Helm.
• Extensive experience in configuring and optimizing high-performance database environments, particularly MySQL.
• Demonstrated capability to develop robust, secure CI/CD deployment pipelines that incorporate automated rollback and quality gates.
• Strong technical documentation abilities, encompassing the creation of architectural diagrams, detailed specifications, and operational playbooks.
• Capacity to lead cross-functional projects independently while mentoring junior engineers and driving team-wide initiatives.
• Health benefits
• Flexible paid time off
• Parental leave
• Fertility and adoption assistance
• 401(k)
• Educational reimbursement
Investigo
Software Mind
Cherokee Federal
Avaya
Get handpicked remote jobs straight to your inbox weekly.