
Platform Engineer
Posted 23 hours ago

Posted 23 hours ago
This is a fully remote position, open to applicants in Minnesota.
• Develop, implement, and sustain infrastructure automation solutions for development, testing, and production environments utilizing Terraform and related Infrastructure as Code (IaC) tools.
• Architect and provision dependable cloud resources across AWS and Azure, encompassing compute, networking, and storage.
• Oversee and manage Kubernetes clusters and containerized applications using Docker and Helm charts.
• Establish and maintain networking essentials such as DNS, load balancing, and firewall settings.
• Assist with troubleshooting issues related to scalability, high availability, performance, backup, and restoration scenarios.
• Aid in monitoring cloud costs and identify optimization opportunities within AWS and Azure environments.
• Enhance and support Continuous Integration/Continuous Deployment (CI/CD) pipelines in Azure DevOps, including YAML pipelines, self-hosted agent pools, and service integrations.
• Implement and uphold GitOps workflows leveraging ArgoCD or Flux, maintaining desired state through version control.
• Create and sustain reusable pipeline templates and shared workflow libraries to prevent application teams from reinventing solutions for every service.
• Collaborate with engineering teams to plan and execute software releases and patches.
• Contribute to the development of self-service developer tools and internal platform capabilities that minimize friction and ticket-driven workflows for engineering teams.
• Help define and sustain “Golden Path” templates — standardized, pre-approved workflows for common activities like deploying new services or provisioning databases.
• Collect developer feedback and iterate on platform tools to significantly enhance engineering velocity.
• Produce and maintain clear documentation for platform functionalities and self-service processes.
• Support the complete software development lifecycle (SDLC) — from planning and coding through testing, review, deployment, and operations — by ensuring platform infrastructure is specifically designed to expedite each phase.
• Develop and maintain platform capabilities that support the AI-assisted development lifecycle (AIDLC), including infrastructure and runtime environments that allow autonomous coding tools to function securely within engineering workflows.
• Provision and manage compute environments, access controls, and sandboxed execution contexts for agentic development tools (e.g., Claude Code, Cursor, or similar), making certain they operate autonomously without compromising security or stability.
• Collaborate with engineering teams to establish guardrails, human-in-the-loop checkpoints, and audit trails for autonomous agent workflows — including automated build diagnosis, infrastructure remediation, and code generation pipelines.
• Facilitate the integration of AI agents into CI/CD pipelines, allowing agentic systems to engage in testing, code review, and deployment workflows with appropriate oversight controls.
• Assist in establishing platform standards for LLMOps concerns — including prompt versioning, agent output observability, and per-call cost attribution — as agentic systems transition into production use.
• Build and maintain logging, monitoring, and alerting solutions utilizing the ELK stack or Prometheus and Grafana.
• Ensure the availability, reliability, and performance of mission-critical hosted services.
• Implement and maintain policy enforcement tools (e.g., OPA Gatekeeper or Kyverno) to ensure that security and compliance guardrails are integrated into platform workflows.
• Support InfoSec scans, compliance audits, and SOC 2-related operational controls.
• Stay updated on emerging technologies and contribute innovative ideas to enhance platform capabilities.
• Proven experience in building and maintaining cloud infrastructure on AWS and/or Azure, including compute, networking, and storage.
• Hands-on experience with Kubernetes and Docker in production settings.
• Expertise in infrastructure automation with Terraform or similar IaC tools.
• Experience in supporting and enhancing CI/CD pipelines in Azure DevOps, including YAML pipelines and agent pools.
• Familiarity with GitOps workflows using ArgoCD or Flux.
• Working knowledge of networking principles including DNS, load balancing, and firewall configurations.
• Proficient in at least one scripting language (Python, Bash, PowerShell, or similar).
• Familiarity with logging, monitoring, and alerting tools such as the ELK stack or Prometheus and Grafana.
• Experience with Helm charts and Git version control best practices.
• Knowledge of database technologies such as PostgreSQL or MS SQL Server.
• Experience supporting Linux and/or Windows server environments.
• Familiarity with Application Performance Monitoring (APM) tools such as Elastic APM or New Relic.
• Strong written and verbal communication skills with the ability to document platform capabilities clearly for engineering audiences.
• Capability to manage multiple priorities in a dynamic environment, including collaboration with onshore and offshore teams.
• Experience working within Agile or Scrum methodologies.
• Bachelor’s degree in Computer Science, Engineering, or a related field is required; equivalent practical experience will be considered.
• 2 to 4 years of experience in a DevOps, platform engineering, or cloud infrastructure role.
• 2 or more years of practical experience with AWS and/or Azure.
• Workplace Flexibility
• 401(k) Plan: Deferred and Roth options available to help you save for retirement, with a generous employer match of 50% up to a maximum of 4.5% of gross pay.
• Medical Plans: Comprehensive co-pay or HSA coverage options to keep you and your family healthy.
• Dental and Vision Plans: Access to a large network of providers for dental and vision health.
• Daycare and Medical FSA/HSA: Save on eligible daycare and healthcare expenses with our flexible spending and health savings account plans.
• Group Term Life Insurance: Coverage of $50,000 to provide peace of mind.
• Generous Paid Time Off (PTO) Policies: Ample time to relax and recharge.
• Employee Assistance Program (EAP): Support for personal and professional challenges.
• Voluntary Benefits Available: Additional Life Insurance, Critical Illness Insurance, Accident, Cancer & Hospital Indemnity Insurance, Legal/ID Shield, Pet Insurance.
• Parental Leave: Four weeks of paid paternity leave after one year of employment; twelve weeks of paid leave for the birth parent, eligibility rules apply.
Tango
Accenture Federal Services
Strategize it Inc.
Accela
Get handpicked remote jobs straight to your inbox weekly.