
Principal Platform Engineer
Posted 22 hours ago

Posted 22 hours ago
• Design, implement, and oversee elastic scaling cloud infrastructure on GCP, along with containerization tools such as Kubernetes.
• Create automated pipelines for the training, testing, and deployment of machine learning models utilizing tools like Jenkins, GitHub Actions, or Airflow.
• Deploy observability tools to monitor model drift, accuracy, latency, and performance decline in a production environment.
• Foster collaboration among data engineers, ML engineers, as well as Backend and Frontend engineers.
• Establish thorough monitoring for system health and ML-specific metrics to ensure long-term model reliability.
• Introduce tools that enable individual teams to oversee their workloads effectively.
• Engage in on-call rotation and manage compliance with standards such as SOC.
• 8 - 10+ years of experience in DevOps/Platform Engineering, with a minimum of 2 years dedicated to the operation and maintenance of production ML workloads.
• Extensive hands-on experience with GCP (including VPC-SC, IAM, Organization Policies) and GKE (covering Cluster topology, Helm, Kustomize).
• Strong proficiency with Istio and API Gateways (such as Kong).
• Advanced Terraform expertise, utilizing an Atlantis/GitOps workflow.
• Experience in managing enterprise-level identity and secrets (Auth0, Dex, ESO, or SOPS).
• Proven experience operating Airflow in a production setting and managing an ML-serving stack (Triton, vLLM, MLflow).
• Comfortable overseeing Cloud SQL (PostgreSQL), BigQuery, Elasticsearch, or ClickHouse.
• At least an upper-intermediate proficiency in spoken and written English.
• Address genuine customer challenges.
• Witness your impact firsthand.
• Propel your career forward.
• Gain opportunities to learn new technologies, products, and markets within a dynamic, growth-focused environment.
• Collaborate with other skilled professionals at a company that values its people.
Northmill
Software Mind
Guidehouse
Ford Motor Company
Get handpicked remote jobs straight to your inbox weekly.