
Systems Engineer, HPC
Posted 56 min ago

Posted 56 min ago
This is a fully remote position, open to applicants in Canada.
• We are seeking Systems Engineers / System Administrators to assist in designing, operating, and scaling the infrastructure that supports Mistral’s AI platforms.
• Manage and maintain extensive Linux environments (bare metal, clusters, cloud).
• Oversee system health, resolve incidents, and guarantee high availability.
• Provide support for production and research workloads across various environments.
• Assist in scaling clusters to accommodate hundreds to thousands of nodes.
• Work on systems that handle petabyte-scale storage.
• Enhance performance, reliability, and resource utilization.
• Automate operational tasks using tools such as Python, Bash, Ansible, or Terraform.
• Refine deployment, provisioning, and system lifecycle management.
• Contribute to decisions regarding system design and architecture.
• Collaborate closely with HPC/infrastructure teams, Platform/DevOps engineers, and Research teams.
• Strong experience in Linux systems administration (core requirement).
• Experience in large-scale environments:
• HPC clusters or cloud infrastructure.
• Familiarity with job schedulers (e.g., Slurm).
• Excellent troubleshooting skills across systems, hardware, and networks.
• Knowledge of containers/orchestration (e.g., Kubernetes).
• Experience with storage systems (e.g., Ceph, Lustre, NFS).
• Understanding of networking fundamentals (Ethernet; InfiniBand is a plus).
• Proficiency in Infrastructure as Code / automation tools.
• Experience with GPU or AI/ML technologies.
• Impact: Play a crucial role in scaling Mistral’s state-of-the-art AI infrastructure.
• Growth: Opportunity to shape data center operations from the ground up in a rapidly growing startup environment.
• Collaboration: Work alongside a talented, cross-functional team that is passionate about AI and technology.
• Flexibility: Competitive compensation, benefits, and the opportunity to contribute to groundbreaking projects.
CVS Health
Baylor Genetics
Pure Storage
Get handpicked remote jobs straight to your inbox weekly.