
Data Engineer β Python, PySpark, AWS Glue, Amazon Athena, SQL, Apache Airflow
Posted May 31

Posted May 31
This is a fully remote position, open to applicants in Pakistan.
β’ Develop, enhance, and expand data pipelines and infrastructure utilizing Python, TypeScript, Apache Airflow, PySpark, AWS Glue, and Snowflake.
β’ Design, implement, and oversee ingestion and transformation workflows, including DAGs, alert systems, retries, SLAs, lineage, and cost management.
β’ Partner with platform and AI/ML teams to automate ingestion, validation, and real-time computing workflows, aiming towards a feature store.
β’ Incorporate pipeline health metrics into engineering dashboards to ensure comprehensive visibility and observability.
β’ Model data and execute efficient, scalable transformations within Snowflake and PostgreSQL.
β’ Create reusable frameworks and connectors to standardize the internal processes of data publishing and consumption.
β’ A minimum of 4 years of hands-on experience in production data engineering.
β’ Extensive practical knowledge of Apache Airflow, AWS Glue, PySpark, and Python-based data pipelines.
β’ Proficient SQL skills and experience in operating PostgreSQL within live environments.
β’ Strong grasp of cloud-native data workflows (preferably AWS) and pipeline observability (including metrics, logging, tracing, and alerting).
β’ Demonstrated experience in managing pipelines throughout their entire lifecycle: design, implementation, testing, deployment, monitoring, and iteration.
β’ Experience in performance tuning and cost optimization with Snowflake is preferred.
β’ Preferred experience with real-time or near-real-time processing.
β’ Hands-on experience with a backend TypeScript framework is a significant advantage.
β’ Experience with data quality frameworks, contract testing, or schema management is a plus.
β’ A background in developing internal developer platforms or components of data platforms is beneficial.
β’ Fully remote position.
β’ Compensation in USD.
β’ Work hours aligned with the EST time zone (9 AM to 6 PM EST) or PT time zone.
Confitec
DOMVS iT
Anyone AI
FCamara Consulting & Training
Get handpicked remote jobs straight to your inbox weekly.