
Data Engineer, LATAM β Python, PySpark, AWS Glue, Amazon Athena, SQL, Apache Airflow
Posted May 24

Posted May 24
This is a fully remote position, open to applicants in Brazil.
β’ Develop, enhance, and scale data pipelines and infrastructure utilizing Python, TypeScript, Apache Airflow, PySpark, AWS Glue, and Snowflake.
β’ Create, operationalize, and oversee data ingestion and transformation workflows: DAGs, alerting, retries, SLAs, lineage, and cost management.
β’ Work collaboratively with platform and AI/ML teams to automate ingestion, validation, and real-time computing workflows; aim towards establishing a feature store.
β’ Incorporate pipeline health and metrics into engineering dashboards to ensure complete visibility and observability.
β’ Structure data and execute efficient, scalable transformations in Snowflake and PostgreSQL.
β’ Develop reusable frameworks and connectors to standardize the internal publishing and consumption of data.
β’ Over 4 years of experience in production data engineering.
β’ Extensive, hands-on expertise with Apache Airflow, AWS Glue, PySpark, and Python-based data pipelines.
β’ Strong SQL capabilities and experience managing PostgreSQL in live environments.
β’ Comprehensive understanding of cloud-native data workflows (preferably AWS) and pipeline observability (metrics, logging, tracing, alerting).
β’ Demonstrated experience managing pipelines from start to finish: design, implementation, testing, deployment, monitoring, and iteration.
β’ Compensation will be provided in USD.
β’ Work hours are set according to the EST time zone (9 AM to 6 PM EST) or PT time zone.
Confitec
DOMVS iT
Anyone AI
FCamara Consulting & Training
Get handpicked remote jobs straight to your inbox weekly.