Remotery

Technical Staff Member – Data Intelligence

Posted 1 day ago

This is a fully remote position, open to applicants in United States.

📋 Description

• Collaborate with model researchers to establish the definition of “good data” for our models, encompassing quality metrics, validation checks, and acceptance thresholds.

• Investigate open-source datasets and develop internal datasets that are most appropriate for constructing fundamental World Models.

• Create algorithms for the automated assessment of data quality, management of data domain mixtures, and adaptation from synthetic to real data.

• Monitor datasets, metadata, provenance, and versions to ensure experiments are reproducible and to clarify which data was utilized in various training and evaluation runs.

• Oversee CI/CD and development tools for the data stack (GitHub, Python, PyTorch), while automating repetitive workflows to minimize friction.

• Analyze and enhance throughput, storage, and compute utilization across pipelines and associated assets.


⛳️ Requirements

• Strong foundational knowledge in ML and deep learning, with experience in building and managing large-scale data and/or compute systems.

• Proficient in transitioning between research inquiries and production engineering: capable of analyzing data, conducting analyses, and deploying reliable systems.

• Proven research experience with data compositions, quality, and dataset releases.

• Skilled in designing and executing experiments that yield convincing unbiased results.

• Hands-on experience with distributed processing and orchestration (e.g., Spark, Ray, Airflow, or similar tools).

• Excellent proficiency in Python, along with familiarity with tools related to modern model training workflows (datasets, checkpoints, experiment tracking).

• Strong understanding of data quality: how to measure it, monitor it, and prevent regressions as systems scale.

• Capable of thriving in a fast-paced environment, prioritizing key tasks, and communicating effectively with both researchers and engineers.

• Bonus: experience with large video datasets, dataset curation for training, or development of internal tools for evaluation/analysis in ML environments.


🏝️ Benefits

• Flexible work arrangements.

People also viewed

Urrly1 hour ago

Senior Vice President, Client Strategy

US flagNew York OnlyFull-timeUncategorized$175k – $215k/year
ApplyView job
Weiler Abrasives Group1 hour ago

National Accounts Manager

US flagUnited States OnlyFull-timeUncategorized
ApplyView job
Abbott1 hour ago

Associate Sales Representative, CRM

US flagColorado OnlyFull-timeUncategorized$43.9k – $109.2k/year
ApplyView job
Segoso1 hour ago

3rd Party Collections Specialist

US flagFlorida OnlyFull-timeUncategorized$17 – $20/hour
ApplyView job
DDN1 hour ago

Client Director – Strategic AI Infrastructure

US flagCalifornia OnlyFull-timeUncategorized$175k – $200k/year
ApplyView job
Kandu1 hour ago

Regional Sales Manager

US flagTexas OnlyFull-timeUncategorized$80k – $120k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers