
Site Reliability Engineer
Posted Jun 12

Posted Jun 12
This is a fully remote position, open to applicants in Malaysia.
• Ensure comprehensive global coverage of our products.
• Accountable for providing all engineering teams with an exceptional development experience.
• Lead the implementation of Docker/Kubernetes across all engineering workstations.
• Guarantee that a high-quality observability setup is established.
• Deliver engineering support for all products.
• Collaborate with the Architecture team to shape product development direction.
• Engage in post-incident analysis reviews.
• Ensure all engineering teams enjoy a top-tier development experience through tooling, scripts, and support.
• Drive the implementation of Docker/Kubernetes across all engineering workstations, regardless of the operating system.
• Collaborate with the SRE team and other Architects to minimize discrepancies between workstations and cloud environments, and develop workarounds and solutions when necessary.
• Ensure that an excellent observability setup is in place.
• Serve as the go-to expert within the organization for all matters related to Docker and Containers.
• Maintain an active and up-to-date knowledge base and setup guides.
• Provide engineering support for all products.
• Work alongside the Architecture team to understand the anticipated direction of product development.
• Offer training, support, and resources to engineering teams.
• Inform the engineering team of any necessary code changes to support other cloud-based PaaS products.
• Assist the IT manager with device procurement for engineering teams.
• Supply QA teams with additional tooling and support as needed.
• Help the engineering team resolve workstation setup issues.
• Participate in product scrums as needed.
• Collaborate with the engineering team during code reviews.
• Ensure that vendor dependencies are documented and scoped.
• Address support escalation issues.
• Develop software/scripts to automate tasks and facilitate the work of engineering, operations, and support teams.
• Take part in post-incident reviews.
• Enhance the on-call process by reducing team burden while improving response times to issues.
• Participate in knowledge transfer sessions to enable the wider team to self-serve.
• Capture, analyze, and update relevant metrics (SLI, SLO, SLA).
• Create monitoring solutions to enhance availability and identify anomalies.
• We Are a Home-First Team: LineTen is dedicated to our home-first policy, prioritizing remote work while offering office spaces in London, England, and Porto, Portugal.
• We Believe in Having Fun: Our WellUs team organizes monthly events such as Pet Zoom Calls, 45-minute Yoga sessions, and after-hours cocktail classes.
• We Want You to Take a Break: We emphasize the quality of work over the number of hours logged. We provide flexible working hours and unlimited vacation time.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.