This is a fully remote position, open to applicants in Colombia.

📋 Description

• Define the architecture for data and develop platform strategies, spearheading design across pipelines, warehouses, and data lakes.

• Construct and enhance scalable data pipelines that support both batch and real-time processing.

• Establish and uphold data governance, quality standards, and compliance frameworks throughout the platform.

• Develop monitoring, logging, and alerting systems for data pipelines and services, while contributing to CI/CD workflows for data deployment and automation.

• Propel the modernization of the data platform, focusing on performance, cost-effectiveness, and scalability.

• Adopt an AI-centric approach in your daily tasks, utilizing tools such as Claude, Cursor, and other contemporary AI assistants to deliver superior quality work efficiently.

• Design and execute data contracts and event flows in partnership with backend, platform, and engineering teams.

• Oversee the design and execution of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows.

• Integrate data services with APIs, middleware, and external systems to facilitate comprehensive data consumption.

• Collaborate with leadership on data strategy, translating technical intricacies into actionable decisions for others.

• Work closely with engineering, analytics, AI, and product teams to ensure data platforms align with broader organizational objectives.

• Champion data quality, governance, and best practices for platforms across teams and initiatives.

• Set data engineering standards that enhance the quality and consistency of outputs across the team.

• Mentor junior and mid-level engineers, aiding in their professional growth, confidence, and impact.

• Make pivotal architectural decisions with clear accountability and consideration of long-term implications.

⛳️ Requirements

• A minimum of 7 years of professional experience in data engineering, with a focus on leading complex data platform projects.

• Strong background in system architecture with specialization in distributed data systems.

• Expert-level proficiency in Python, Scala, and SQL.

• Extensive knowledge of cloud-native data platforms and enterprise data warehousing.

• Strong expertise in orchestrating data pipelines and processing.

• Significant experience with streaming platforms and real-time data processing (e.g., Kafka, Kinesis, Pub/Sub).

• Profound data modeling skills and experience in data transformation.

• Solid experience with data quality, governance, and compliance frameworks.

• Strong background in container orchestration and CI/CD practices for data systems.

• Proven experience in building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows.

• Demonstrated leadership and technical mentoring experience within a team or organization.

• Excellent communication skills with the ability to convey complex technical concepts to diverse audiences.

• Regular, hands-on use and expert understanding of AI-enhanced coding tools such as Claude and Cursor.

• Exceptional problem-solving abilities and the capacity to navigate complex technical and business challenges with sound judgment.

• Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus.

🏝️ Benefits

• Competitive salary.

• Flexible working hours.

• Opportunities for professional development.

Staff Data Engineer

📋 Description

⛳️ Requirements

🏝️ Benefits

People also viewed

Senior Data Engineer

Mid-level Data Engineer

AI Data Engineer

Data Engineer

Data Engineer

Data Engineering Manager

Never miss a great job!