
DevOps Engineer – HPC & GPU Platform
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in France.
• Create GPU benchmarking frameworks on AWS: orchestrating benchmark executions, gathering and storing results, facilitating performance comparisons across different versions.
• Develop tools for correctness validation: automating the testing of numerical accuracy of GPU compute outputs against established reference results.
• Execute distributed observability across all platform services: employing structured logging, distributed tracing (Pulsar), and performance metrics.
• Collaborate on wider HPC coding projects alongside the engineering team.
• Proficient Python or Go developer — capable of writing real application code, rather than just scripts.
• Familiarity with observability tools (Prometheus, Grafana, distributed tracing).
• Comfortable working with AWS (EC2, IAM, VPC) and CI/CD pipelines.
• Experience in HPC or GPU environments is highly advantageous — familiarity with Slurm, compute clusters, and GPU workloads.
• Preferred educational background includes ENSIMAG, Centrale, INSA, X, or equivalent engineering qualifications.
• Fluent in English — the team operates across France and the UK.
• Fully remote work, with 1 day per week in London.
Advanced Solutions International, Inc.
Stone
Replit
Soum
Get handpicked remote jobs straight to your inbox weekly.