Remotery

Principal Platform Infrastructure Engineer – SRE Enablement

Posted 1 day ago

📋 Description

• Design, govern, and oversee the deployment and operation of high-scale, multi-region VM and Kubernetes infrastructures on GCP and AWS, ensuring optimal resilience and performance across all environments.

• Facilitate technical alignment across functions with Engineering, Product, Compliance, and Security teams.

• Establish and uphold organizational best practices and standards for Infrastructure as Code (IaC) utilizing Terraform and Spacelift.

• Create and manage intricate configuration management and deployment workflows that enhance reliability and operational efficiency across the platform.

• Determine the technical direction and implement extensive observability solutions (Grafana Cloud, Prometheus/Mimir, OTel collectors).

• Define the strategic architecture and lifecycle management of core platform services.

• Proactively identify and spearhead large-scale strategic initiatives to reduce technical toil and enhance operational efficiency.

• Mentor and provide technical support to both junior and senior engineers.

• Engage in a 24x7 on-call rotation as part of a globally distributed team.


⛳️ Requirements

• Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience.

• Proficiency in commonly used programming and scripting languages such as Python, Bash, and Go.

• Familiarity with network topologies, communication protocols (e.g., TCP/IP, HTTP/S, UDP, TLS), and enterprise-grade connectivity solutions.

• Expertise in Kubernetes, including cluster administration, RBAC, networking, workload management, and troubleshooting in production environments.

• Demonstrated experience with Terraform for infrastructure provisioning and management.

• Knowledge of Google Cloud Platform services, including GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.

• Previous experience and success in mentoring junior and senior engineers.

• Familiarity with GitOps methodologies and tools.

• Clear understanding of how to effectively utilize LLM code assist tools to build software.


🏝️ Benefits

• Our culture is collaborative, inclusive, and enjoyable.

• Five core values: Stay Aligned, Get It Done, Customer Empathy, Think Creatively, and Help Each Other Out.

• Opportunities to take initiative, implement new ideas, and create a lasting impact.

People also viewed

Arctiq18 hours ago

Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Arctiq18 hours ago

Senior Site Reliability Engineer

US flagVirginia OnlyFreelanceDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind18 hours ago

Senior DevOps Manager, German speaking

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Mediastream18 hours ago

DevOps Engineer

RO flagRomania OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Kyndryl18 hours ago

Site Reliability Engineer

US flagOhio OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$161.5k – $290.8k/year
ApplyView job
Guidehouse18 hours ago

Senior Azure DevOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$118k – $196k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers