Remotery

AI Engineer, Data Pipeline

Posted May 7

This is a fully remote position, open to applicants in India.

📋 Description

• Develop data ingestion pipelines to extract and transform enterprise data.

• Execute data cleansing and normalization processes.

• Create and manage ETL jobs utilizing Spark/PySpark on cloud platforms.

• Enforce data validation and quality assurance measures at every stage of the pipeline.

• Construct automated data export jobs for datasets used in model training.

• Assist with feature extraction from enterprise schemas.

• Oversee pipeline health, diagnose failures, and enhance performance.

• Maintain thorough documentation of data lineage, schemas, and transformation logic.


⛳️ Requirements

• Minimum of 3 years of experience in software engineering.

• Proficient in Python and data processing tools (such as pandas, PySpark, or their equivalents).

• Knowledge of SQL and relational databases (including MySQL, PostgreSQL).

• Familiarity with cloud data services (such as object storage, managed Spark, managed ETL, or similar).

• Comprehension of ETL/ELT methodologies and data pipeline architecture.

• Experience with various data formats (including Parquet, JSON, Avro).

• Strong focus on data quality and testing practices.

• Bachelor’s degree in Computer Science or equivalent experience.


🏝️ Benefits

• Pioneering Technology: At Coupa, we are leading the way in innovation, utilizing advanced technology to provide our customers with enhanced efficiency and visibility in their spending.

• Collaborative Culture: We emphasize teamwork and collaboration, fostering a culture characterized by transparency, openness, and a collective commitment to excellence.

• Global Impact: Become part of an organization where your contributions have a worldwide, measurable effect on our clients, the business, and one another.

People also viewed

Agiloft2 hours ago

AI Data Platform Lead

CA flagCanada OnlyFull-timeData Engineer
ApplyView job
Oscilar2 hours ago

Data Engineer

BR flagBrazil OnlyFull-timeData Engineer
ApplyView job
HubSpot2 hours ago

Senior Product Manager, Events Data Platform

US flagUnited States OnlyFull-timeData Engineer$140k – $175k/year
ApplyView job
Prima3 hours ago

Technical Product Manager – Data Platform

IT flagItaly OnlyFull-timeData Engineer
ApplyView job
Newfire Global Partners3 hours ago

Senior Director, Clinical Data Engineering

US flagMassachusetts OnlyFull-timeData Engineer$229k – $280k/year
ApplyView job
Latino Legends3 hours ago

Senior Data Engineer

AR flagArgentina OnlyFull-timeData Engineer$6,000 – $8,500/month
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers