
Site Reliability Engineer
Posted 5 days ago

Posted 5 days ago
• Participate in and enhance the entire lifecycle of services - from initial concept and design to deployment, operation, and refinement.
• Assist in the development of services from the planning stage prior to their launch through activities such as system design consulting, software platform and framework development, capacity planning, and launch reviews.
• Offer technical leadership and direction to team members regarding the management of availability and performance of mission-critical services, automation to prevent recurring issues, and automated responses for standard service conditions.
• Oversee services once they are operational by measuring and monitoring availability, latency, and overall system health.
• Sustainably scale systems using mechanisms like automation, and promote changes that enhance reliability and velocity.
• Plan for the growth of cloud infrastructure capacity.
• Enhance operational processes including deployments and upgrades.
• Manage the execution of project priorities, deadlines, and deliverables.
• Participate in an on-call rotation to address incidents affecting platform availability.
• Utilize your on-call shifts to mitigate the occurrence of incidents.
• Experience in incident response, including conducting post-mortems and applying lessons learned, contributes to improved system reliability.
• Over 10 years of experience in engineering or systems.
• Proficiency in leveraging cloud architecture, applying site reliability principles, and/or showing awareness of operational concerns.
• Strong knowledge of network design and architecture.
• Experience in scaling and managing distributed systems.
• Extensive experience with monitoring and observability platforms.
• Proven ability to debug, fix, and optimize code.
• Troubleshooting expertise across network, application, and distributed services layers.
• The capacity to learn rapidly and adapt to new technologies is crucial.
• Outstanding communication skills, both verbal and written.
• Health & Wellbeing
• Personal & Professional Development
• Unconditional Inclusion
Arctiq
Arctiq
Software Mind
Mediastream
Get handpicked remote jobs straight to your inbox weekly.