
Solutions Engineer – Media
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Brazil.
• Take ownership of data quality and manage the curation of media datasets.
• Collaborate with Sales and Solutions teams to convert customer needs into effective curation strategies.
• Handle imperfect partner data, which may include mismatched metadata, schema variations, and incomplete labeling.
• Normalize and standardize datasets to ensure reliable use in downstream applications.
• Utilize SQL, internal APIs, and metadata tools to query and analyze Protege’s media catalog for relevant content.
• Develop validation checks and workflows to guarantee dataset integrity prior to delivery.
• Detect, troubleshoot, and rectify data quality issues across file structures, metadata, and content alignment.
• Leverage AI tools and transcoded embeddings to enhance and refine clip-level content.
• Transform unstructured, real-world data into structured datasets that align with customer and model requirements.
• Conduct iterative sample reviews with customers, assimilate feedback, refine selections, and ensure final packages adhere to specifications.
• Acquire in-depth knowledge of Protege’s media catalog structure, metadata, and growth trends.
• Monitor content coverage, diversity, and modality mix, identifying gaps in relation to customer demand.
• Collaborate with Product and Partnerships teams to share catalog insights that shape sourcing priorities.
• Work across departments to ensure that content packaging complies with technical, ethical, and licensing standards.
• Create methods, scripts, and internal tools that enhance curation efficiency and scalability.
• Contribute to the development of Protege’s delivery platform, including how internal users and customers search, sample, and export data.
• Collaborate closely with embedding-based systems to iterate between algorithmic selection and human review.
• Establish best practices for embedding queries, relevance assessment, and content diversity.
• Uphold a high standard of operational excellence and quality assurance throughout all processes.
• 4-7 years of experience in data science, media analytics, technical curation, or related hands-on data roles.
• Proficient in SQL with the ability to query large, complex datasets to derive insights and drive actions.
• Experience with media metadata, embeddings, or unstructured content.
• Skill in translating nuanced customer or model specifications into clear dataset requirements.
• High standards for data quality, operational rigor, and the usability of delivered outputs.
• Effective communicator capable of bridging technical detail and customer-friendly clarity.
• Ability to thrive in ambiguous, fast-paced environments while treating colleagues with respect and kindness.
• Health insurance
• Professional development opportunities
• Flexible work arrangements
• Remote work options
NVIDIA
Towa Software
AIM Qualifications and Assessment Group
Get handpicked remote jobs straight to your inbox weekly.