Remotery

AI Infrastructure & Platform Operations Engineer

Posted 6 days ago

This is a fully remote position, open to applicants in Poland.

📋 Description

• Oversee, manage, and provide support for production AI infrastructure platforms.

• Identify and resolve incidents related to infrastructure, networking, hardware, and platforms.

• Provide support for NVIDIA GPU infrastructure and related platform services.

• Monitor and troubleshoot environments based on Kubernetes.

• Examine performance, availability, and reliability concerns across infrastructure and platform components.

• Work collaboratively with engineering teams, hardware vendors, datacenter staff, and service delivery teams to address technical challenges.

• Engage in incident response, root cause analysis, and initiatives aimed at operational improvements.

• Contribute to enhancements in monitoring, observability, automation, and operational procedures.

• Keep operational documentation, runbooks, and knowledge articles up to date.


⛳️ Requirements

• A minimum of 3 years of experience in infrastructure operations, platform operations, network operations, site reliability engineering, cloud operations, datacenter operations, or similar technical roles.

• Proficient in Linux administration and troubleshooting.

• Solid understanding of networking principles and experience in diagnosing infrastructure-related problems.

• Familiarity with Kubernetes in production settings.

• Experience in supporting production infrastructure and services.

• Strong analytical and problem-solving abilities.

• Experience adhering to structured operational and incident management processes.

• Exceptional communication and collaboration abilities.

• Willingness to work within a shift-based operational framework.


🏝️ Benefits

• Work with some of the most cutting-edge AI infrastructure environments currently in production.

• Gain exposure to NVIDIA GPU technologies, Kubernetes platforms, and high-performance networking environments.

• Contribute to defining the operational and support framework for next-generation AI infrastructure.

• Be part of a team that is shaping the future of AI-powered operations through k0rdent AI.

• Join an expanding organization that is heavily investing in AI infrastructure and platform services.

People also viewed

Attio2 days ago

Senior Platform Engineer

PL flagPoland OnlyFull-timePlatform Engineer€95k – €125k/year
ApplyView job
Devoteam3 days ago

AWS Platform Engineer

PT flagPortugal OnlyFull-timePlatform Engineer
ApplyView job
TechBiz Global6 days ago

Platform Engineer

CH flagSwitzerland OnlyFull-timePlatform Engineer
ApplyView job
TM Forum6 days ago

Junior Platform Engineer

IN flagIndia OnlyFull-timePlatform Engineer
ApplyView job
TIMOCOM6 days ago

Platform Engineer

DE flagGermany OnlyFull-timePlatform Engineer
ApplyView job
Vira Games6 days ago

Senior Platform Engineer

EuropeFull-timePlatform Engineer
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers