
Site Reliability Engineer
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Brazil.
• Design, develop, and enhance CI/CD pipelines for applications and infrastructure.
• Create automation frameworks that minimize manual work and enhance consistency.
• Configure and optimize cloud infrastructure to adhere to best practices for security, scalability, and performance.
• Work alongside development teams to eliminate deployment obstacles and enhance delivery processes.
• Monitor reliability and performance, proactively identify issues, and implement data-driven enhancements to boost uptime and efficiency.
• Participate in on-call rotations and lead incident resolution, providing clear postmortems and preventive measures.
• Maintain comprehensive technical documentation for pipelines, configurations, and runbooks.
• Conduct readiness assessments and validation tests prior to production deployments.
• Implement Infrastructure as Code utilizing Terraform and ARM templates, ensuring version control and reproducibility.
• Diagnose complex deployment, provisioning, and performance challenges across multicloud and containerized environments.
• 3 to 5 years of experience in SRE or DevOps roles managing production systems.
• Hands-on experience with production workloads on Kubernetes in a cloud setting, including cluster design, autoscaling, upgrades, and network policies.
• Demonstrated CI/CD delivery proficiency using GitHub Actions or Jenkins, including environment promotion, approvals, and rollback strategies.
• Expertise in Infrastructure as Code with Terraform and ARM templates, covering modules, remote state, workspaces, and policy enforcement.
• Strong scripting skills in PowerShell, Bash, or Python for automation and diagnostics.
• Experience with GitOps utilizing Argo CD or Flux, overseeing multi-environment application delivery and drift remediation.
• Familiarity with containerization using Docker and Kubernetes, including health probes, PodDisruptionBudgets, resource quotas, HorizontalPodAutoscaler, and operators.
• Understanding of networking fundamentals and cloud network security practices such as VNet design, NSGs, Private Link, and ingress controllers.
• Knowledge of cloud security and compliance, including least privilege, secrets management, audit trails, and control evidence.
• Excellent written and verbal communication skills in English.
• Ability to collaborate effectively across US time zones.
• [Add benefits details here]
• [Add additional benefits details here]
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.