
Member of Engineering, Evaluations
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Europe.
• Conduct research and implement evaluations and benchmarks for both foundational models and instruction-following models.
• Partner with applied research and product teams to establish significant metrics and evaluations that reflect our advancement in real-world software development capabilities.
• Collaborate within a team environment: plan upcoming initiatives, engage in discussions, and communicate effectively with colleagues.
• Proficient experience with Large Language Models (LLM).
• Deep understanding and intuition regarding LLMs and their inherent limitations.
• An appreciation for quality and a curious approach to learning.
• Robust engineering background.
• Strong programming abilities, preferably across various languages.
• Familiarity with the complete software development life cycle.
• Practical programming experience.
• Proficiency in Linux.
• Strong skills in algorithms.
• Knowledge of multiple programming languages, including Python.
• Utilization of modern tools with a continuous improvement mindset.
• Excellent critical thinking skills and the capacity to challenge code quality policies when necessary.
• Fully remote work with flexible hours.
• 37 days per year of vacation and holidays.
• Health insurance coverage for you and your dependents.
• Equipment provided by the company.
• Allowances for wellbeing, ongoing learning, and home office setup.
• Regular team gatherings.
• A vibrant, diverse, and inclusive people-first culture.
SD Solutions
SIS International Research & Strategy Consulting
Roblox
Get handpicked remote jobs straight to your inbox weekly.