
Senior ML Solutions Architect – Token Factory
Posted May 22

Posted May 22
This is a fully remote position, open to applicants in Singapore.
• Design and develop LLM-based solutions utilizing Nebius Token Factory’s inference services to enhance business value and fulfill customer objectives.
• Create production-ready applications by leveraging our serverless LLM APIs, including multimodal models (text, vision, audio) and specialized domain models.
• Offer technical expertise in prompt engineering, RAG architectures, model selection, and optimization of inference.
• Work in conjunction with product and engineering teams to gather customer feedback and influence the platform roadmap.
• Assist customers in transitioning from proof of concept to production, focusing on performance, reliability, and cost-effectiveness.
• Over 5 years of experience in ML/AI systems, with a minimum of 2 years dedicated to LLMs and generative AI.
• Extensive knowledge of the LLM ecosystem, encompassing model architectures and fine-tuning methodologies.
• Practical experience with:
• Prompt engineering and LLM pipeline development, including evaluation.
• Agentic frameworks such as Langchain, Langsmith, smolagents, or comparable alternatives.
• Vector databases and RAG implementation strategies.
• Deploying LLM-based applications using APIs from OpenAI, Anthropic, or open-source models.
• Proficient in Python programming.
• Strong communication skills, capable of clearly articulating technical concepts to varied audiences.
• Competitive salary along with a comprehensive benefits package.
• Opportunities for professional development within Nebius.
• Flexible working arrangements.
• A vibrant and collaborative work environment that encourages initiative and innovation.
NVIDIA
Towa Software
AIM Qualifications and Assessment Group
Get handpicked remote jobs straight to your inbox weekly.