Remotery

Senior Site Reliability Engineer

Posted 6 days ago

This is a fully remote position, open to applicants in Latin America.

📋 Description

• Design, develop, and advance cloud infrastructure architectures ensuring high availability, reliability, security, and scalability.

• Establish and uphold reference architectures and patterns for services, applications, and environments throughout the organization.

• Create workflow processes and standards for building, deploying, and managing applications within a distributed architecture.

• Spearhead infrastructure modernization projects (e.g., containerization, Kubernetes integration, infrastructure as code, platform consolidation).

• Set and enforce governance standards for infrastructure, CI/CD, observability, and operational practices.

• Develop and maintain policies for environment management, access control, configuration management, and change management.

• Implement cost management strategies (e.g., tagging, budget alerts, rightsizing, reservations/committed use, auto-scaling policies) to optimize cloud expenditures.

• Collaborate with product and engineering leadership to balance performance, reliability, and cost-effectiveness across environments.

• Utilize DORA metrics and industry benchmarks to foster continuous improvement in delivery and operational performance.

• Design, implement, and oversee CI/CD pipelines for multiple applications and environments utilizing tools such as Git, Azure DevOps, GitLab, or Jenkins.

• Develop and manage automation pipelines for deployment, configuration, and infrastructure oversight.

• Construct and maintain monitoring, alerting, and logging systems to ensure visibility, high availability, and performance of applications and services.

• Oversee cloud infrastructure resources and services to guarantee reliability, security, and scalability.

• Direct incident response efforts, including triage, root cause analysis, and post-incident evaluations.

• Contribute to and maintain incident response processes, documentation, and on-call practices.

• Collaborate with engineering teams to design resilient systems and decrease mean time to recovery (MTTR).

• Work alongside software engineering, QA, product, and IT teams to determine optimal solutions for complex infrastructure, security, and delivery issues.

• Mentor engineers in DevOps and platform practices, tools, and standards across the organization.

• Lead departmental initiatives related to DevOps, platform engineering, and infrastructure disciplines; present plans and progress to stakeholders.

• Propel new departmental initiatives based on organizational needs and your expertise in modern technologies and industry trends.

• Stay informed about emerging technologies, tools, and best practices; assess their potential integration within our tech stack.


⛳️ Requirements

• BS or MS in Computer Science, Engineering, or a related technical field, or equivalent practical experience.

• 6+ years of experience with container orchestration services (Kubernetes preferred).

• 6+ years of experience administering and deploying CI/CD tools (e.g., Git, Azure DevOps, Jira, GitLab, Jenkins).

• 6+ years of experience managing scalable applications in one or more major cloud environments.

• 8+ years of substantial experience with both Windows and Linux operating system environments.

• 7+ years of experience with scripting and automation using tools such as PowerShell, Bash, or Python.

• 4+ years of experience with infrastructure-as-code and orchestration platforms (e.g., Terraform, ARM/Bicep, CloudFormation, Ansible, etc.).

• Proven expertise in designing architectures for scalable, reliable, and secure tech stacks within distributed systems.

• Proven experience in implementing workflow processes for operating and maintaining applications in distributed architectures.


🏝️ Benefits

• Comprehensive health, dental, and vision insurance.

• Generous paid time off and flexible work arrangements.

• Opportunities for professional development and continuous learning.

• Collaborative and innovative work environment.

People also viewed

Experian40 min ago

SRE Specialist

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
In All Media40 min ago

Azure DevOps Engineer, ML Ops Engineer

Latin AmericaFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
knowmad mood40 min ago

Backend Developer, PHP, React, DevOps

ES flagSpain OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Work Life Group1 hour ago

Lead DevOps Engineer, Data & AI Platform

HU flagHungary OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
accesa.eu1 hour ago

DevOps Engineer, German

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cisco1 hour ago

Site Reliability Engineer – Kubernetes Platform

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers