Remotery

Principal Platform Infrastructure Engineer – SRE Enablement

Posted 1 day ago

This is a fully remote position, open to applicants in United Kingdom.

📋 Description

• Design, govern, and oversee the deployment and operation of large-scale, multi-region VM and Kubernetes infrastructure on GCP and AWS, ensuring optimal resilience and performance across all environments.

• Foster cross-functional technical collaboration with Engineering, Product, Compliance, and Security teams.

• Establish and uphold organizational best practices and standards for Infrastructure as Code (IaC) utilizing Terraform and Spacelift.

• Create and manage intricate configuration management and deployment workflows that enhance reliability and operational efficiency throughout the entire platform.

• Define the technical trajectory and implement comprehensive observability solutions (Grafana Cloud, Prometheus/Mimir, OTel collectors).

• Establish the strategic architecture and lifecycle management for core platform services.

• Proactively identify and spearhead large-scale strategic initiatives to reduce technical toil and boost operational efficiency.

• Mentor and provide technical advice to both junior and senior engineers.

• Participate in a 24x7 on-call rotation as part of a globally distributed team.


⛳️ Requirements

• Bachelor's degree in Computer Science, a related technical field, or equivalent hands-on experience.

• Proficient in popular programming and scripting languages such as Python, Bash, and Go.

• Understanding of network architectures, communication protocols (e.g., TCP/IP, HTTP/S, UDP, TLS), and enterprise-level connectivity solutions.

• Expertise in Kubernetes, including cluster administration, RBAC, networking, workload management, and troubleshooting in production environments.

• Demonstrated experience with Terraform for infrastructure provisioning and management.

• Familiarity with Google Cloud Platform services such as GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.

• Previous experience mentoring both junior and senior engineers successfully.

• Experience with GitOps methodologies and tools.

• Clear understanding of how to effectively utilize LLM code assist tools for software development.


🏝️ Benefits

• Our culture promotes collaboration, inclusivity, and enjoyment.

• We uphold five core values: Stay Aligned, Get It Done, Customer Empathy, Think Creatively, and Help Each Other Out.

• Opportunities to take initiative, implement new ideas, and make a lasting impact.

People also viewed

Investigo9 hours ago

Senior Cloud - Kubernetes SRE

GB flagUnited Kingdom OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Software Mind9 hours ago

DevOps Engineer

AR flagArgentina OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job
Cherokee Federal9 hours ago

DevSecOps Engineer

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$125k – $140k/year
ApplyView job
Avaya9 hours ago

Site Reliability Engineer – Azure, DevSecOps, IaC, Governance, Observability

US flagUnited States OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$129k – $143k/year
ApplyView job
Agilent Technologies9 hours ago

DevOps Engineer – Platform, AWS, CI/CD

US flagColorado OnlyFull-timeDevOps & Site Reliability Engineer (SRE)$143.8k – $224.6k/year
ApplyView job
Dropbox9 hours ago

Site Reliability Engineer

PL flagPoland OnlyFull-timeDevOps & Site Reliability Engineer (SRE)
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers