
Site Reliability Engineer
Posted 5 days ago

Posted 5 days ago
This is a fully remote position, open to applicants in United States.
• Implement observability solutions for applications and infrastructure to guarantee desired levels of availability, reliability, and performance.
• Engage in regular On-Call rotations and document incidents and their resolutions through post-mortem reports and routine review meetings.
• Collaborate proactively with Product and Engineering teams to identify, develop, deploy, and maintain dependable systems and services.
• Influence and create innovative designs, architectures, standards, and methodologies for large-scale systems.
• Maintain a high level of reliability for essential services and automated systems.
• Automate processes to enhance reliability, performance, and availability.
• Update technical documentation, workflows, and knowledge base articles regularly.
• Provide constructive feedback on pull requests and during peer coding reviews.
• Implement automated solutions that integrate Dynatrace, Azure DevOps, and Jira.
• Possess solid knowledge in specific areas of OneStream Software.
• Capable of mentoring others in various technical domains.
• Understand the practical application of SOC/FedRAMP controls to support Compliance and Security teams.
• Bachelor's degree in computer science, engineering, or a technology-related field (or equivalent work experience).
• Demonstrated experience as a Site Reliability Engineer or in a comparable role.
• Over 6 years of experience in cloud infrastructure and software development.
• At least 2 years of hands-on experience with Azure Kubernetes Services (AKS) and container-based deployment skills, or other platforms such as OpenShift, GKS, or EKS.
• Advanced expertise in APM and observability tools like Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus, and Grafana.
• Advanced knowledge of Infrastructure-as-Code (IaC) concepts and tools (Terraform, CloudFormation templates, Bicep, or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP).
• In-depth knowledge of Configuration Management/Orchestration tools such as Ansible, PowerShell DSC, Chef, and Puppet.
• Comprehensive understanding of cloud concepts, including elasticity, security, and identity management.
• Familiarity with Agile Development methodologies using Jira or Azure DevOps Boards.
• Over 6 years of hands-on experience with technologies, tools, and concepts such as automating processes using PowerShell, Bash, CLI, REST APIs, Python, ARM Templates, or other scripting languages.
• Proficient in using source control tools like Git, Azure DevOps, or GitHub.
• Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS, or Helm.
• Experienced with Microsoft Azure, Amazon Web Services (AWS), or Google Cloud (GCP).
• Vision
• Medical
• Life
• Dental
• 401K
Innovative Solutions
Caspar Health
IVIX
Investigo
Get handpicked remote jobs straight to your inbox weekly.