
Platform Infrastructure Engineer – SRE Core
Posted 11 hours ago

Posted 11 hours ago
This is a fully remote position, open to applicants in United Kingdom.
• Design, implement, and oversee VM and Kubernetes infrastructure on GCP and AWS across numerous clusters encompassing development, staging, and production environments in various regions.
• Collaborate with colleagues both within your immediate team and across different teams to ensure that your tasks effectively address the problems we aim to resolve.
• Develop and manage Infrastructure as Code (IaC) utilizing Terraform modules, overseeing resources via Spacelift or similar Terraform Automation and Collaboration Software (TACOS). Provision cloud infrastructure, including networking, compute, storage, and security components, primarily on GCP, with additional AWS support.
• Execute and manage workflows with advanced multi-layer configuration management.
• Create and sustain comprehensive observability solutions utilizing Grafana Cloud, Prometheus/Mimir, and OTel collectors. Design Grafana dashboards, configure alerting rules, and ensure visibility across all platform components.
• Oversee certificate lifecycle, DNS automation, ingress controllers, and service mesh networking with Cilium.
• Collaborate with Engineering, Product, Compliance, and Security teams to design robust, scalable systems. Provide consultation on capacity planning, disaster recovery, and architectural choices for cloud-native applications.
• Identify and mitigate toil through automation. Write scripts, develop tools, and construct CI/CD pipelines to enhance operational efficiency and minimize manual tasks.
• Engage in a 24x7 on-call rotation as part of a globally distributed team, responding to incidents and leading post-incident reviews.
• Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience.
• Proficiency in common programming and scripting languages, with a significant focus on Python, Bash, and Go.
• Familiarity with network topologies, communication protocols (e.g., TCP/IP, HTTP/S, UDP, TLS), and enterprise-grade connectivity solutions.
• Expertise in Kubernetes, including cluster administration, RBAC, networking, workload management, and troubleshooting in production environments.
• Demonstrated experience with Terraform for infrastructure provisioning and management.
• Knowledge of Google Cloud Platform services such as GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.
• Experience with GitOps methodologies and tools.
• Collaborative, inclusive, and enjoyable culture.
• Opportunities to take initiative.
• Support for innovative ideas.
• Open communication.
Cision France
Navigate Power
Get handpicked remote jobs straight to your inbox weekly.