
Site Reliability Engineer
Posted 18 hours ago

Posted 18 hours ago
• Join our team as a Site Reliability Engineer (SRE) and play a key role in ensuring the reliability, resiliency, and innovation of our information systems and ecosystems.
• Assess business requirements, address intricate challenges, and deliver strategic insights and designs.
• Participate in all phases of the software development lifecycle, from creation and testing to deploying changes and maintaining robust systems.
• Cultivate trusted relationships with customers and collaborate with them to achieve success.
• Work on comprehensive end-to-end services that span across customer sites and platforms.
• Collaborate proactively with a skilled team of professionals.
• Over 10 years of experience in operational management, including incident management and escalation procedures.
• Proven experience in designing and implementing application monitoring solutions to ensure reliability and performance that meets or exceeds business objectives.
• History of implementing strategies to manage operational load and handle overflow using suitable tools and metrics; defining service level indicators and objectives in partnership with stakeholders, business units, development, DevSecOps, and operations teams.
• Expertise in solutions and design within an enterprise environment: Windows server, Linux server (preferably RHEL), UNIX (AIX, Solaris).
• Proficient in Windows server, storage, and Hyperscaler Cloud (AWS, Azure, Google Cloud Platform); experience with public cloud platforms such as AWS, OpenShift, Azure, or GCP.
• Familiarity with data formats and scripting languages including JSON, YAML, Bash, and/or PowerShell.
• Medical and dental coverage
• Disability
• Retirement benefits
• Paid leave
• Paid time off
Arctiq
Arctiq
Software Mind
Mediastream
Get handpicked remote jobs straight to your inbox weekly.