
Senior Infrastructure Engineer – On-Prem
Posted May 30

Posted May 30
This is a fully remote position, open to applicants in India.
• Design, implement, and manage enterprise-level platform services across both on-premises and hybrid cloud settings.
• Develop and sustain Infrastructure as Code pipelines utilizing Terraform, ArgoCD, and GitHub Actions—instilling GitOps principles into each deployment.
• Operate and scale Kubernetes clusters at an advanced level: not merely deploying workloads, but also overseeing cluster administration, networking, and security.
• Establish and oversee Big Data infrastructure on-premises (Hadoop, EMR-equivalent), providing the same analytical capabilities that enterprises expect from cloud solutions.
• Architect and manage distributed storage systems (MinIO, S3-compatible) and distributed caching layers (Redis, Memcached) within production environments.
• Create and maintain a comprehensive observability stack—OpenTelemetry, Prometheus, Grafana, and Loki—ensuring the team has the insight needed to operate confidently.
• Proven experience in production environments delivering cloud-like functionalities (EMR, S3, Lambda, SQS equivalents) to on-premises solutions.
• Practical experience in establishing Big Data infrastructure on-premises (e.g., Hadoop, EMR on bare metal).
• Expertise in Infrastructure as Code: Terraform, ArgoCD, and GitHub Actions.
• Strong skills in GitOps and pipeline automation—you advocate for repeatable, auditable deployments.
• Advanced Kubernetes capabilities, including cluster administration, beyond just application deployment.
• Experience in production with distributed storage infrastructure (MinIO / S3-like).
• Experience in production with distributed caching infrastructure (Redis, Memcached).
• Solid foundation in observability tools: OpenTelemetry, Prometheus + Grafana, and Loki.
• Experience in managing local artifact registries (Nexus or similar).
• Proficiency in hypervisor and operating system-level troubleshooting and performance analysis.
• Backup and recovery strategies for distributed data systems.
• Capacity analysis and scalability forecasting—you plan proactively, not just reactively.
• Flexible work environment
• Direct access to leadership
• A clear path for rapid progression in your role
• Competitive salary with performance bonuses
Pagefreezer
Orro Group
Feldera
Webflow
Get handpicked remote jobs straight to your inbox weekly.