Remotery

Technical Staff Member – Data Intelligence

Posted May 6

This is a fully remote position, open to applicants in United States.

📋 Description

• Collaborate with model researchers to establish the definition of “good data” for our models, which includes quality metrics, validation checks, and acceptance thresholds.

• Investigate open source datasets and develop internal datasets that are most appropriate for constructing fundamental World Models.

• Create algorithms for automated evaluation of data quality, data domain mixtures, and the adaptation of synthetic data to real data.

• Monitor datasets, metadata, provenance, and versions to ensure experiments are reproducible and to clarify the data used in various training and evaluation processes.

• Oversee CI/CD and development tools for the data stack (GitHub, Python, PyTorch), and automate repetitive tasks to streamline workflows.

• Evaluate and enhance throughput, storage, and compute utilization across pipelines and associated assets.


⛳️ Requirements

• Strong foundational knowledge in ML and deep learning, with experience in building and managing large-scale data and/or computing systems.

• Comfortably navigate between research inquiries and production engineering: capable of analyzing data, conducting analyses, and deploying reliable systems.

• Proven research experience related to data compositions, quality, and dataset releases.

• Skill in designing and executing experiments that yield convincing and unbiased results.

• Practical experience with distributed processing and orchestration tools (such as Spark, Ray, Airflow, or similar alternatives).

• Proficient in Python, with familiarity in tools associated with contemporary model training workflows (datasets, checkpoints, experiment tracking).

• Strong understanding of data quality: methods for measurement, monitoring, and preventing regressions as systems scale.

• Capable of thriving in a dynamic environment, prioritizing effectively, and communicating clearly with both researchers and engineers.

• Bonus: experience with large video datasets, dataset curation for training purposes, or developing internal tools for evaluation/analysis in ML environments.


🏝️ Benefits

• Flexible work arrangements

People also viewed

Horizon Hobby7 hours ago

Oracle Fusion Developer

US flagIllinois OnlyFull-timeSoftware Engineer$110k – $125k/year
ApplyView job
GP Strategies Corporation7 hours ago

Software Integrations Developer

IN flagIndia OnlyFull-timeSoftware Engineer
ApplyView job
IQVIA7 hours ago

Director, Software Development

US flagIllinois, +1 more stateFull-timeSoftware Engineer$119.9k – $334.2k/year
ApplyView job
Bart & Associates, Inc.7 hours ago

Senior PeopleSoft Developer

US flagMissouri OnlyFreelanceSoftware Engineer
ApplyView job
Socure7 hours ago

Developer Marketing Lead

US flagCalifornia, +3 more statesFull-timeSoftware Engineer$120k – $140k/year
ApplyView job
MSD7 hours ago

Associate Director – Engineering

US flagCalifornia, +1 more stateFull-timeSoftware Engineer$142.4k – $224.1k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers