
Principal Machine Learning Engineer
Posted Jun 20

Posted Jun 20
This is a fully remote position, open to applicants in United Kingdom.
• Develop and manage comprehensive ML pipelines encompassing data handling, training, evaluation, inference, and deployment.
• Refine and adapt models utilizing cutting-edge techniques such as LoRA, QLoRA, SFT, DPO, and distillation.
• Design and maintain scalable inference systems while balancing latency, cost, and reliability.
• Create and sustain data systems for high-quality synthetic and real-world training datasets.
• Implement evaluation pipelines to assess performance, robustness, safety, and bias, collaborating with research leadership.
• Oversee production deployment, focusing on GPU optimization, memory efficiency, latency reduction, and scaling strategies.
• Work closely with application engineering to seamlessly integrate ML systems into backend, mobile, and desktop applications.
• Make practical trade-offs and deliver enhancements swiftly, learning from actual usage.
• Operate within real production constraints: latency, cost, reliability, and safety.
• Solid foundation in deep learning and transformer-based architectures.
• Practical experience in training, fine-tuning, or deploying large-scale ML models in a production setting.
• Proficiency in at least one contemporary ML framework (e.g., PyTorch, JAX), with the capability to learn others swiftly.
• Familiarity with distributed training and inference frameworks (e.g., DeepSpeed, FSDP, Megatron, ZeRO, Ray).
• Strong software engineering principles – you create robust, maintainable, production-ready systems.
• Experience in GPU optimization, focusing on memory efficiency, quantization, and mixed precision.
• Comfort in owning ambiguous, zero-to-one ML systems from start to finish.
• A proactive approach to shipping, rapid learning, and enhancing systems through iteration.
• Applications are assessed by our technical team members.
• Interviews will be conducted through virtual meetings and/or onsite.
• We prioritize transparency and efficiency, so expect a swift decision.
Flock Safety
Inspiren
OneStudyTeam
CDW
Get handpicked remote jobs straight to your inbox weekly.