Remotery

Site Reliability Engineer

Posted May 20

This is a fully remote position, open to applicants in Brazil.

📋 Description

• Design, develop, and enhance CI/CD pipelines for applications and infrastructure.

• Create automation frameworks that minimize manual work and enhance consistency.

• Configure and optimize cloud infrastructure to adhere to best practices for security, scalability, and performance.

• Work alongside development teams to eliminate deployment obstacles and enhance delivery processes.

• Monitor reliability and performance, proactively identify issues, and implement data-driven enhancements to boost uptime and efficiency.

• Participate in on-call rotations and lead incident resolution, providing clear postmortems and preventive measures.

• Maintain comprehensive technical documentation for pipelines, configurations, and runbooks.

• Conduct readiness assessments and validation tests prior to production deployments.

• Implement Infrastructure as Code utilizing Terraform and ARM templates, ensuring version control and reproducibility.

• Diagnose complex deployment, provisioning, and performance challenges across multicloud and containerized environments.


⛳️ Requirements

• 3 to 5 years of experience in SRE or DevOps roles managing production systems.

• Hands-on experience with production workloads on Kubernetes in a cloud setting, including cluster design, autoscaling, upgrades, and network policies.

• Demonstrated CI/CD delivery proficiency using GitHub Actions or Jenkins, including environment promotion, approvals, and rollback strategies.

• Expertise in Infrastructure as Code with Terraform and ARM templates, covering modules, remote state, workspaces, and policy enforcement.

• Strong scripting skills in PowerShell, Bash, or Python for automation and diagnostics.

• Experience with GitOps utilizing Argo CD or Flux, overseeing multi-environment application delivery and drift remediation.

• Familiarity with containerization using Docker and Kubernetes, including health probes, PodDisruptionBudgets, resource quotas, HorizontalPodAutoscaler, and operators.

• Understanding of networking fundamentals and cloud network security practices such as VNet design, NSGs, Private Link, and ingress controllers.

• Knowledge of cloud security and compliance, including least privilege, secrets management, audit trails, and control evidence.

• Excellent written and verbal communication skills in English.

• Ability to collaborate effectively across US time zones.


🏝️ Benefits

• [Add benefits details here]

• [Add additional benefits details here]

People also viewed

Advanced Solutions International, Inc.11 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone11 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers