
Senior System Software Engineer β Dynamo-Triton Inference Server
Posted Jun 20

Posted Jun 20
This is a fully remote position, open to applicants in California, +1 more state.
β’ Create top-tier GPU-accelerated AI inference serving software.
β’ Participate in feature development and promote widespread customer adoption.
β’ Lead the integration of the Triton Inference Server and NVIDIA Dynamo stacks to create a cohesive, high-performance inference platform.
β’ Ensure feature equivalency and efficiently support both Large Language Model (LLM) and non-LLM workloads.
β’ Develop resilient software suitable for deployment in production server or cloud settings.
β’ Optimize and balance prediction throughput alongside latency.
β’ Innovate and implement the next generation of inference technologies.
β’ MS or PhD in Computer Science or a related field (or equivalent experience).
β’ Over 5 years of professional experience in deep learning software development.
β’ Proficient in Rust and C++ programming languages.
β’ Knowledge of Python.
β’ Strong programming and software design abilities, including debugging, performance analysis, and test design.
β’ Experience with large-scale distributed systems and machine learning systems.
β’ Excellent communication skills and the ability to thrive in a fast-paced, agile team environment.
β’ Equity.
β’ Comprehensive benefits package.
EverCommerce
PlanetScale
Slingshot Aerospace
Upstart
Get handpicked remote jobs straight to your inbox weekly.