
Infrastructure Engineer
Posted 1 hour ago

Posted 1 hour ago
This is a fully remote position, open to applicants in New York.
• Assist application teams by efficiently handling infrastructure requests (permissions, roles, service setup, project peering) to allow product engineers to concentrate on development.
• Take ownership of CI/CD processes and deployments: enhance and maintain our GitHub Actions workflows, and facilitate the transition to a dedicated CD tool with appropriate permissions — the objective is fully automated, secure deployments via service accounts, eliminating direct engineer access to production.
• Develop and sustain infrastructure as code: create and modify Terraform modules for both new and existing services across GCP environments.
• Implement Kubernetes effectively: oversee service deployments using Helm (currently on Helm 4) and ensure the health of asynchronous workloads on Dagster.
• Streamline observability (likely the initial project): unify the current per-team alerting into a singular view — system-to-system dashboards and incident alerting that directs upstream service/vendor failures to the appropriate impacted teams and on-call rotations.
• Enhance resilience: assist in transitioning to a completely region- and cloud-agnostic architecture, allowing services to relocate seamlessly in the event of a failure.
• Fortify security and access: enforce IAM policies, manage secrets, uphold least privilege principles, and ensure auditability; contribute towards SOC 2 compliance.
• Leverage AI for automation: develop agent skills / agents.md so that routine tasks (such as provisioning access and simple modifications) can be performed by an agent rather than requiring human engineering time, and utilize AI to analyze more complex issues.
• Strong software engineering principles in at least one production language (Python, Go, TypeScript, or Rust); proficiency in Python is particularly valued, along with comfort in scripting and shell usage.
• Practical experience with cloud infrastructure and essential cloud services, particularly GCP (AWS/Azure experience is transferable).
• Familiarity with operating large-scale Kubernetes production environments.
• Experience with Infrastructure as Code, specifically using Terraform.
• Knowledge of CI/CD systems, especially GitHub Actions or Octopus Deploy.
• Capability to debug production issues utilizing logs, metrics, traces, shell tools, and source code.
• Fundamentals of security and access control: IAM, secrets management, least privilege, and auditability principles.
• Proficient written communication regarding incidents, design decisions, and operational procedures.
• Competitive salary and performance-based bonuses.
• Comprehensive health, dental, and vision insurance.
• Flexible work hours and remote work opportunities.
• Professional development and continuous learning support.
• Generous paid time off and holiday policies.
MDaudit
Intersect Power
boxxe
Feldera
Get handpicked remote jobs straight to your inbox weekly.