
Senior Data Scientist, LLM
Posted May 24

Posted May 24
This is a fully remote position, open to applicants in Brazil.
• Create, refine, and assess Visual Language Models (VLMs) to improve document comprehension, with an emphasis on multimodal data such as text, images, and technical illustrations.
• Develop and execute data preparation, cleaning, and augmentation strategies specifically designed for multimodal model training, ensuring high-quality data pipelines for VLMs.
• Utilize transfer learning and pre-trained models to expedite model development and enhance performance on Xometry’s unique datasets.
• Employ cloud resources (e.g., Amazon Web Services) to efficiently scale the training and fine-tuning processes for VLMs.
• Work in collaboration with data engineering and machine learning operations (MLOps) teams to implement VLMs in production and track their performance.
• Analyze model outputs and enhance model accuracy and robustness by employing data analysis and visualization tools (such as Python, Jupyter Notebooks, and SQL).
• Test and apply cutting-edge model architectures, consistently optimizing VLM performance in a dynamic, iterative setting.
• Engage in a team-oriented environment, participating in peer evaluations, sharing insights, and contributing to a culture of continuous learning and advancement.
• A bachelor’s degree is mandatory; an advanced degree (M.S. or PhD) in computer science, data science, machine learning, or a related discipline is strongly preferred.
• Over 5 years of experience in data science and machine learning, with a specialization in Visual Language Models or multimodal machine learning.
• Extensive experience with machine learning libraries and frameworks such as PyTorch, TensorFlow, or Hugging Face.
• Proficient in Python, including libraries like pandas, numpy, and scikit-learn.
• Comprehensive understanding of deep learning methodologies and experience with transfer learning, fine-tuning, and model assessment.
• Familiarity with cloud platforms (e.g., AWS SageMaker) for model training and deployment.
• Knowledge of data processing and visualization tools (SQL, Jupyter Notebooks, Looker, etc.) and fundamental database concepts (e.g., Snowflake, MongoDB).
• Strong analytical and problem-solving capabilities, with a proven ability to thrive in an environment that encourages teamwork, innovation, and ongoing learning.
• Experience with computer vision tasks and frameworks, along with exposure to multimodal data, is a plus.
• Xometry is an equal opportunity employer. All applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity
AVENCORE
Smadex
ShipBob, Inc.
Get handpicked remote jobs straight to your inbox weekly.