
Senior Site Reliability Engineer
Posted May 23

Posted May 23
This is a fully remote position, open to applicants in Spain.
• Create, develop, and manage dependable and scalable systems by establishing and tracking SLOs/SLIs.
• Proactively implement automation for infrastructure and operational processes to minimize manual tasks and decrease MTTR.
• Regularly assess and enhance system performance and cost efficiency, providing data, insights, and recommendations for capacity planning, while supporting security best practices through hands-on vulnerability mitigation and threat management.
• SRE & Cloud Engineering: Practical experience with SRE methodologies in a production environment, extensive AWS knowledge, Kubernetes, networking, DNS, and Infrastructure as Code (Pulumi preferred, Terraform is a plus).
• Automation & Software Engineering: Exhibit strong software engineering principles with a focus on code quality and maintainability.
• This encompasses solid proficiency in Python and a deep understanding of the Python ecosystem (including testing, debugging, and packaging) with a consistent emphasis on producing clean, well-organized, and maintainable code.
• Reliability, Data & Operations: Engage with stakeholders and provide mentorship, such as leading incident responses and RCAs, enhancing system reliability, and collaborating with stakeholders to propose solutions, share insights, and mentor others.
• Work Your Way: Enjoy full flexibility – work from home, the office, or a combination of both.
• Work remotely from anywhere for up to 30 days each year.
• Access to learning resources, mentorship, and a personalized growth plan tailored to your career aspirations.
• Benefit from private healthcare, gym discounts, wellbeing programs, and mental health support.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.