This is a fully remote position, open to applicants in Latin America.

📋 Description

• Design, develop, and advance cloud infrastructure architectures ensuring high availability, reliability, security, and scalability.

• Establish and uphold reference architectures and patterns for services, applications, and environments throughout the organization.

• Create workflow processes and standards for building, deploying, and managing applications within a distributed architecture.

• Spearhead infrastructure modernization projects (e.g., containerization, Kubernetes integration, infrastructure as code, platform consolidation).

• Set and enforce governance standards for infrastructure, CI/CD, observability, and operational practices.

• Develop and maintain policies for environment management, access control, configuration management, and change management.

• Implement cost management strategies (e.g., tagging, budget alerts, rightsizing, reservations/committed use, auto-scaling policies) to optimize cloud expenditures.

• Collaborate with product and engineering leadership to balance performance, reliability, and cost-effectiveness across environments.

• Utilize DORA metrics and industry benchmarks to foster continuous improvement in delivery and operational performance.

• Design, implement, and oversee CI/CD pipelines for multiple applications and environments utilizing tools such as Git, Azure DevOps, GitLab, or Jenkins.

• Develop and manage automation pipelines for deployment, configuration, and infrastructure oversight.

• Construct and maintain monitoring, alerting, and logging systems to ensure visibility, high availability, and performance of applications and services.

• Oversee cloud infrastructure resources and services to guarantee reliability, security, and scalability.

• Direct incident response efforts, including triage, root cause analysis, and post-incident evaluations.

• Contribute to and maintain incident response processes, documentation, and on-call practices.

• Collaborate with engineering teams to design resilient systems and decrease mean time to recovery (MTTR).

• Work alongside software engineering, QA, product, and IT teams to determine optimal solutions for complex infrastructure, security, and delivery issues.

• Mentor engineers in DevOps and platform practices, tools, and standards across the organization.

• Lead departmental initiatives related to DevOps, platform engineering, and infrastructure disciplines; present plans and progress to stakeholders.

• Propel new departmental initiatives based on organizational needs and your expertise in modern technologies and industry trends.

• Stay informed about emerging technologies, tools, and best practices; assess their potential integration within our tech stack.

⛳️ Requirements

• BS or MS in Computer Science, Engineering, or a related technical field, or equivalent practical experience.

• 6+ years of experience with container orchestration services (Kubernetes preferred).

• 6+ years of experience administering and deploying CI/CD tools (e.g., Git, Azure DevOps, Jira, GitLab, Jenkins).

• 6+ years of experience managing scalable applications in one or more major cloud environments.

• 8+ years of substantial experience with both Windows and Linux operating system environments.

• 7+ years of experience with scripting and automation using tools such as PowerShell, Bash, or Python.

• 4+ years of experience with infrastructure-as-code and orchestration platforms (e.g., Terraform, ARM/Bicep, CloudFormation, Ansible, etc.).

• Proven expertise in designing architectures for scalable, reliable, and secure tech stacks within distributed systems.

• Proven experience in implementing workflow processes for operating and maintaining applications in distributed architectures.

🏝️ Benefits

• Comprehensive health, dental, and vision insurance.

• Generous paid time off and flexible work arrangements.

• Opportunities for professional development and continuous learning.

• Collaborative and innovative work environment.

Senior Site Reliability Engineer

📋 Description

⛳️ Requirements

🏝️ Benefits

People also viewed

SRE Specialist

Azure DevOps Engineer, ML Ops Engineer

Backend Developer, PHP, React, DevOps

Lead DevOps Engineer, Data & AI Platform

DevOps Engineer, German

Site Reliability Engineer – Kubernetes Platform

Never miss a great job!