
Principal Systems Engineer – HPC/AI System Administrator, Multi-discipline Expert
Posted May 9

Posted May 9
This is a fully remote position, open to applicants in United States.
• Oversee and manage daily operations of HPC systems, ensuring reliable scheduling and system software functionality.
• Work collaboratively with GDIT HPC engineers, system administrators, developers, and NWS operational personnel.
• Implement enhancements to system performance, scheduler dependability, and operational durability.
• Employ expertise in Linux system administration, HPC scheduling, scripting languages, and performance monitoring tools.
• A minimum of 15 years of relevant experience.
• Must be a U.S. Citizen.
• Bachelor’s degree in Arts or Science.
• Proficient in Linux system administration, preferably with Rocky or SLES.
• Familiarity with HPC batch schedulers such as PBS Pro, Slurm, or equivalent.
• Skilled in scripting languages including Bash, Python, and Perl.
• Comprehension of HPC architectures, distributed computing, and MPI-based workloads.
• Strong troubleshooting capabilities in multi-node HPC environments.
• Health insurance.
• 401(k) plan with company matching.
• Comprehensive benefits and wellness packages.
• Paid time off including vacation, sick leave, and personal days.
• 15 days of paid leave plus 10 paid holidays annually.
• Paid parental leave, military leave, bereavement leave, and jury duty leave.
• Short-term and long-term disability benefits.
• Life and accident insurance.
The Growth Partner
Highmark Health
Mitratech
Get handpicked remote jobs straight to your inbox weekly.