Remotery

AWS Data Science – Specialist

Posted 1 day ago

This is a fully remote position, open to applicants anywhere in the world.

📋 Description

• Oversee the advancement and enhancement of Feature Store functionalities, including data lineage, feature views, feature recommendation, and the integration of new query engines;

• Design and develop Apache Iceberg tables with an emphasis on optimizing read performance, versioning, and schema evolution;

• Architect and refine the serving layer utilizing Redis to deliver real-time features that meet stringent latency service level objectives (SLOs);

• Integrate and enhance Amazon EMR as a query and large-scale processing solution;

• Define and execute feature selection and transformation pipelines ensuring complete traceability;

• Establish quality standards, versioning protocols, and governance practices for features throughout the platform;

• Act as the technical authority for data and data science teams utilizing the Feature Store.


⛳️ Requirements

• Proficiency in feature engineering within enterprise machine learning platforms such as Feast, Tecton, Hopsworks, or similar;

• Advanced skills in Apache Spark/PySpark for extensive distributed processing tasks;

• In-depth understanding of Apache Iceberg and lakehouse architectures, including comparisons with Delta Lake and Hudi;

• Expertise in Redis for low-latency feature delivery, encompassing cache invalidation methods and effective serialization;

• Strong hands-on experience with AWS data services including S3, Glue, EMR, Redshift, and Athena;

• Preferred: familiarity with data lineage and metadata catalog systems (DataHub, OpenMetadata, Marquez) in production settings; experience with Amazon EMR, particularly in cluster configuration, optimization, and Spark job tuning; knowledge of MLOps practices emphasizing versioning and traceability of data artifacts; previous experience in financial environments with high-cardinality, high-frequency data and compliance requirements; and an understanding of scalable data quality tools such as Great Expectations, Soda, and dbt tests.


🏝️ Benefits

• Opportunity to work in a dynamic and innovative environment;

• Access to cutting-edge technologies and tools;

• Collaborative culture that fosters professional growth and development.

People also viewed

Zeta Global1 hour ago

Data Collaboration Lead

US flagUnited States OnlyFull-timeData Scientist$180k – $200k/year
ApplyView job
Humana2 hours ago

Lead Data Scientist

US flagKentucky, +4 more statesFull-timeData Scientist$142.3k – $195.7k/year
ApplyView job
Binance.US12 hours ago

Senior Data Scientist, Product Analytics

US flagUnited States OnlyFull-timeData Scientist$170k – $195k/year
ApplyView job
10x Genomics12 hours ago

Head of Data and Insights

US flagUnited States OnlyFull-timeData Scientist$318.3k – $430.7k/year
ApplyView job
Dynatron Software, Inc.12 hours ago

Product Manager – MS, Data

US flagUnited States OnlyFull-timeData Scientist$140k – $150k/year
ApplyView job
Circle12 hours ago

Staff Data Scientist – Digital Assets

US flagCalifornia OnlyFull-timeData Scientist$195k – $257.5k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers