
Site Reliability Engineer
Posted May 11

Posted May 11
This is a fully remote position, open to applicants in Philippines.
• Design, develop, and sustain scalable and dependable infrastructure and platform services.
• Create and uphold infrastructure-as-code solutions (e.g., CloudFormation, Terraform).
• Build custom automation workflows and internal tools to facilitate infrastructure provisioning, monitoring, and incident management.
• Oversee system performance, availability, and capacity utilizing observability tools (e.g., SumoLogic, AWS CloudWatch).
• Generate and maintain dashboards and monitoring solutions that provide comprehensive insights into platform health, aiding in swift incident diagnosis.
• Streamline operational processes (e.g., deployments, failovers, scaling) to minimize manual effort and enhance system resilience.
• Engage in incident response activities, including postmortem analyses and root cause investigations, to promote continuous improvement.
• Continuously advance and uphold Service Level Objectives (SLOs) and Service Level Indicators (SLIs), ensuring an equilibrium between development pace and system reliability.
• Design and execute robust Continuous Integration/Continuous Deployment (CI/CD) pipelines and strategies for zero-downtime deployments.
• Collaborate with engineering teams to integrate reliability, scalability, performance, and security best practices throughout the Software Development Life Cycle (SDLC).
• A minimum of 2 years of experience in a Site Reliability Engineering (SRE) role or a comparable position (e.g., DevOps Engineer).
• Proven experience managing an AWS environment and operating within a SaaS business model.
• Solid understanding and hands-on experience with infrastructure-as-code.
• Experience in constructing and supporting robust CI/CD pipelines.
• Strong problem-solving and analytical capabilities.
• Excellent communication and teamwork skills.
• Ability to thrive in a fast-paced, agile setting.
• Familiarity with BuildKite.
• Experience with distributed systems and microservices architecture.
• Exposure to compliance frameworks such as PCI-DSS and ISO27001.
• Health insurance.
• Paid time off.
• Opportunities for professional development.
Innovative Solutions
Caspar Health
IVIX
Investigo
Get handpicked remote jobs straight to your inbox weekly.