Remotery

Site Reliability Engineer

Posted 5 days ago

This is a fully remote position, open to applicants in United States.

📋 Description

• Implement observability solutions for applications and infrastructure to guarantee desired levels of availability, reliability, and performance.

• Engage in regular On-Call rotations and document incidents and their resolutions through post-mortem reports and routine review meetings.

• Collaborate proactively with Product and Engineering teams to identify, develop, deploy, and maintain dependable systems and services.

• Influence and create innovative designs, architectures, standards, and methodologies for large-scale systems.

• Maintain a high level of reliability for essential services and automated systems.

• Automate processes to enhance reliability, performance, and availability.

• Update technical documentation, workflows, and knowledge base articles regularly.

• Provide constructive feedback on pull requests and during peer coding reviews.

• Implement automated solutions that integrate Dynatrace, Azure DevOps, and Jira.

• Possess solid knowledge in specific areas of OneStream Software.

• Capable of mentoring others in various technical domains.

• Understand the practical application of SOC/FedRAMP controls to support Compliance and Security teams.


⛳️ Requirements

• Bachelor's degree in computer science, engineering, or a technology-related field (or equivalent work experience).

• Demonstrated experience as a Site Reliability Engineer or in a comparable role.

• Over 6 years of experience in cloud infrastructure and software development.

• At least 2 years of hands-on experience with Azure Kubernetes Services (AKS) and container-based deployment skills, or other platforms such as OpenShift, GKS, or EKS.

• Advanced expertise in APM and observability tools like Dynatrace, AppInsights, DataDog, Log Analytics, New Relic, Prometheus, and Grafana.

• Advanced knowledge of Infrastructure-as-Code (IaC) concepts and tools (Terraform, CloudFormation templates, Bicep, or ARM templates) on Microsoft Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP).

• In-depth knowledge of Configuration Management/Orchestration tools such as Ansible, PowerShell DSC, Chef, and Puppet.

• Comprehensive understanding of cloud concepts, including elasticity, security, and identity management.

• Familiarity with Agile Development methodologies using Jira or Azure DevOps Boards.

• Over 6 years of hands-on experience with technologies, tools, and concepts such as automating processes using PowerShell, Bash, CLI, REST APIs, Python, ARM Templates, or other scripting languages.

• Proficient in using source control tools like Git, Azure DevOps, or GitHub.

• Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS, or Helm.

• Experienced with Microsoft Azure, Amazon Web Services (AWS), or Google Cloud (GCP).


🏝️ Benefits

• Vision

• Medical

• Life

• Dental

• 401K

People also viewed

Innovative Solutions1 hour ago

Cloud Engineer – DevOps

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$100k – $160k/year
ApplyView job
Caspar Health1 hour ago

DevSecOps/DevOps Engineer

DE flagGermany OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
IVIX1 hour ago

Deployment Engineer

US flagNew York OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Investigo11 hours ago

Senior Cloud - Kubernetes SRE

GB flagUnited Kingdom OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind11 hours ago

DevOps Engineer

AR flagArgentina OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cherokee Federal11 hours ago

DevSecOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$125k – $140k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers