
AWS Data Science – Specialist
Posted 1 day ago

Posted 1 day ago
This is a fully remote position, open to applicants anywhere in the world.
• Oversee the advancement and enhancement of Feature Store functionalities, including data lineage, feature views, feature recommendation, and the integration of new query engines;
• Design and develop Apache Iceberg tables with an emphasis on optimizing read performance, versioning, and schema evolution;
• Architect and refine the serving layer utilizing Redis to deliver real-time features that meet stringent latency service level objectives (SLOs);
• Integrate and enhance Amazon EMR as a query and large-scale processing solution;
• Define and execute feature selection and transformation pipelines ensuring complete traceability;
• Establish quality standards, versioning protocols, and governance practices for features throughout the platform;
• Act as the technical authority for data and data science teams utilizing the Feature Store.
• Proficiency in feature engineering within enterprise machine learning platforms such as Feast, Tecton, Hopsworks, or similar;
• Advanced skills in Apache Spark/PySpark for extensive distributed processing tasks;
• In-depth understanding of Apache Iceberg and lakehouse architectures, including comparisons with Delta Lake and Hudi;
• Expertise in Redis for low-latency feature delivery, encompassing cache invalidation methods and effective serialization;
• Strong hands-on experience with AWS data services including S3, Glue, EMR, Redshift, and Athena;
• Preferred: familiarity with data lineage and metadata catalog systems (DataHub, OpenMetadata, Marquez) in production settings; experience with Amazon EMR, particularly in cluster configuration, optimization, and Spark job tuning; knowledge of MLOps practices emphasizing versioning and traceability of data artifacts; previous experience in financial environments with high-cardinality, high-frequency data and compliance requirements; and an understanding of scalable data quality tools such as Great Expectations, Soda, and dbt tests.
• Opportunity to work in a dynamic and innovative environment;
• Access to cutting-edge technologies and tools;
• Collaborative culture that fosters professional growth and development.
Zeta Global
Humana
Binance.US
10x Genomics
Get handpicked remote jobs straight to your inbox weekly.