Remotery

Especialista SRE II – Tech Lead

Posted May 20

This is a fully remote position, open to applicants in Brazil.

📋 Description

• Ensure the availability, reliability, and performance of production systems by monitoring runtime behavior and participating in global on-call rotations in accordance with SRE best practices.

• Design, develop, and maintain automation tools, internal libraries, and infrastructure abstractions to enhance resilience and minimize manual operational tasks.

• Lead configuration, testing, security, and deployment processes utilizing Infrastructure as Code, CI/CD pipelines, and cloud-native technologies to ensure consistent and secure deployments.

• Advocate for SDLC standards, DevOps practices, and operational excellence while collaborating with engineering teams across various regions.

• Collaborate with software engineers, project managers, and cross-functional partners in Brazil and globally to support system design, deployments, and ongoing operations.

• Mentor team members and participate in technical interviews to enhance technical skills and operational maturity.

• Continuously evaluate cloud infrastructure to identify performance bottlenecks, reliability risks, and optimization opportunities to enhance scalability and manage costs.


⛳️ Requirements

• Bachelor’s degree in Computer Science, Engineering, or related technical fields (or equivalent experience).

• Strong background in Site Reliability Engineering and DevOps practices within cloud-based environments.

• Familiarity with Google Cloud Platform (GCP) and Amazon Web Services (AWS).

• Proficient in Infrastructure as Code using Terraform and Atlantis, including the development of reusable modules.

• Experience with CI/CD tools such as GitHub Actions, Harness, Jenkins, and Nexus.

• Expertise in containerization and orchestration technologies like Kubernetes, Helm, and GKE.

• Experience with workflow orchestration tools such as Airflow or Cloud Composer.

• Knowledge of data and messaging platforms including BigQuery, CloudSQL, and Pub/Sub.

• Familiarity with monitoring and observability tools like Stackdriver/Cloud Logging and Looker.

• Programming/scripting skills in Python, Golang, Scala, or Shell.

• Strong understanding of cloud architecture fundamentals, including networking, compute, storage, and messaging.

• Experience working with globally distributed systems and teams.

• Excellent troubleshooting, automation, and systems-thinking skills.

• Demonstrated technical leadership and mentoring abilities.

• Proficient in English and willing to travel to São Carlos/SP as needed.


🏝️ Benefits

• Support for diversity initiatives.

• Affinity groups established to empower and support individuals from underrepresented groups: ExperianPride (LGBTQIAPN+ community), Ubuntu (racial equity), Women in Experian (gender equity), Aspire (people with disabilities), and Connecting Generations (generations).

People also viewed

Advanced Solutions International, Inc.12 hours ago

DevOps Reliability Engineer

AU flagAustralia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$90k – $110k/year
ApplyView job
Stone12 hours ago

Senior Site Reliability Engineer – Network

BR flagBrazil OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Replit1 day ago

Staff Site Reliability Engineer

EuropeFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Soum1 day ago

DevOps Engineer, Mid Level

EG flagEgypt OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Lakeside Software1 day ago

DevOps Engineer, Azure

IN flagIndia OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Interval Group1 day ago

DevOps Engineer, mk8s

DE flagGermany OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers