
Staff Platform Engineer
Posted 1 day ago

Posted 1 day ago
This is a fully remote position, open to applicants in California, +2 more states.
• Define, design, and establish standards for composable Infrastructure as Code (IaC) patterns using CDK or Terraform for Cloud Infrastructure (EKS).
• Promote the adoption and implementation of composable, idempotent, multi-environment GitOps workflows.
• Enhance scalability and optimize cost-per-performance through metrics-driven automation and autoscaling technologies such as Karpenter.
• Create and sustain observability across the platform utilizing Prometheus, Grafana, and distributed tracing techniques.
• Lead collaborative initiatives with application teams to establish Service Level Objectives (SLOs) and capacity models for essential services.
• Oversee major production incident responses, facilitate blameless postmortems, and mentor fellow engineers on incident response, postmortems, and reliability evaluations.
• Design and manage resilient CI/CD pipelines to ensure safe and rapid deployments and rollbacks.
• Advocate for automation, minimal toil operations, and foster a culture of continuous improvement.
• Develop agentic workflows that leverage the team-wide context layer and operational data to expedite development while maintaining reliability.
• Serve as a technical leader and mentor to uplift the team, demonstrating strong listening skills and the ability to evaluate and provide constructive feedback on ideas.
• Proven expertise in architecting, deploying, and maintaining high-throughput, low-latency distributed systems within cloud production environments, ideally in Platform, SRE, or DevOps roles. Previous ownership of a stateful deployment is essential.
• In-depth skills in systems-level coding for automation and systems development, particularly in Go, Python, or TypeScript.
• Demonstrated experience operating Kubernetes at scale (EKS preferred) and implementing IaC patterns (CDK, Terraform).
• Familiarity with GitOps and reconciliation loops in Kubernetes controllers.
• Strong experience with CI/CD systems, including GitHub Actions and AWS CodePipeline.
• Expertise in defining, designing, and optimizing global monitoring and alerting pipelines, including PromQL, metrics correlation, and alert noise reduction.
• Experience with large-scale streaming or ad-serving workloads, including HTTP-based delivery (oRTB, VAST), event streaming (Kafka), and AWS network architecture (VPC, load balancers, peering).
• Understanding of cloud security best practices, including IAM, encryption, network segmentation, and zero trust principles.
• Proven capability to conduct comprehensive performance analysis, tuning, and optimization across the entire infrastructure stack to meet cost-per-performance and latency objectives.
• Strong Medical, Dental and Vision Benefits, fully covered by Wurl.
• Remote-first work policy.
• Flexible Time Off.
• 10 US Holidays.
• 401(k) Matching.
• Pre-Tax Savings Plans, Health Savings Account (HSA) & Flexible Spending Account (FSA).
• Subscriptions to Ginger, Aaptiv, and Headspace for mental and physical wellness.
• OneMedical subscription for 24/7 convenient medical care.
• Paid Maternity and Parental Leave for all family additions.
• Discounted PetPlan and easy at-home access to Covid testing through empowerDX.
• $1,000 Work From Home Stipend to establish your office.
Cision France
Navigate Power
Get handpicked remote jobs straight to your inbox weekly.