
Platform Engineer
Posted May 21

Posted May 21
This is a fully remote position, open to applicants in Brazil.
• Design, implement, and enhance Infrastructure as Code (Terraform, Flux, Helm) for provisioning and managing Kubernetes clusters across AWS, GCP, Azure, and future cloud providers.
• Develop and maintain self-service capabilities — automated provisioning pipelines, environment management, and deployment workflows — enabling engineering teams to autonomously manage their cloud resources without platform team intervention.
• Architect and manage GitOps-driven CI/CD pipelines, creating Golden Paths and standardized templates that elevate the quality baseline for all product teams.
• Drive the DevSecOps strategy for the platform, integrating compliance policies (SOC2, GDPR, customer audits) directly into automation pipelines and Kubernetes admission controls.
• Define and track SLIs/SLOs for platform-owned systems; lead incident response during significant outages and conduct blameless post-mortem analyses and subsequent actions.
• Oversee and enhance developer tooling platforms like Backstage, GitLab, Artifactory, and SonarQube, consistently improving the developer experience based on team input.
• Establish the observability framework and standards (metrics, tracing, log aggregation) across various teams, ensuring platform reliability through proactive system health monitoring.
• Create Kubernetes operators and other cloud-native automation to minimize operational toil and boost platform resilience.
• Execute disaster recovery planning, testing, and runbook documentation; design and maintain Production Readiness Reviews and operational procedures.
• Serve as a Platform Engineering Champion for domain teams during refinements and technical spikes, providing dedicated sprint capacity for platform-related projects.
• Monitor and evaluate infrastructure costs across cloud platforms; develop tools for proactive capacity planning.
• Minimum of 3 years of experience in Platform Engineering, DevOps, Cloud Engineering, or Site Reliability Engineering roles.
• In-depth knowledge of Kubernetes and its ecosystem (cluster lifecycle, node pools, networking, RBAC, resource optimization, operators) across at least two major cloud providers — this is a non-negotiable requirement.
• Practical experience with Infrastructure as Code tools, preferably Terraform, Flux, and Helm.
• Strong fundamentals in Linux systems administration and networking (load balancers, DNS, VPNs, firewalls, TCP/IP).
• Solid understanding of information security principles with hands-on experience applying DevSecOps practices to cloud infrastructure, including Policy-as-Code and compliance requirements (SOC2, GDPR).
• Experience in designing and managing CI/CD pipelines and GitOps workflows (GitLab CI, FluxCD, ArgoCD, or similar).
• Comprehensive observability experience with modern tooling (OpenTelemetry, Prometheus, or similar).
• Strong understanding of SRE practices: SLI/SLO definitions, error budgets, and blameless postmortems.
• Proven experience in driving incident response processes, post-incident reviews, and operational follow-up actions.
• Demonstrated capability to create and maintain production readiness documentation, runbooks, and disaster recovery procedures.
• Familiarity with Chaos Engineering principles and proactive reliability practices.
• Proven ability to develop self-service platform capabilities that reduce developer toil and eliminate ticket-driven workflows.
• Experience in managing and evolving developer tooling platforms (Backstage, GitLab, Artifactory, SonarQube, or similar).
• Customer-oriented mindset — ability to translate developer pain points into practical, adoptable platform solutions.
• Excellent communication skills in both English and Portuguese (written and verbal), with a proven ability to collaborate across cross-functional, remote-first teams.
• Healthcare
• Dental care
• R$ 1.400,00/month on Caju card (for food and meal allowance, mobility, home office supplies, culture, health, and education)
• Life insurance
• Child care assistance
• Wellhub (Gympass)
• English course: partnership for group classes for R$100 monthly
• Global Equity Program
Attio
TechBiz Global
Get handpicked remote jobs straight to your inbox weekly.