
Staff Data Engineer
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in Colombia.
• Define the architecture for data and develop platform strategies, spearheading design across pipelines, warehouses, and data lakes.
• Construct and enhance scalable data pipelines that support both batch and real-time processing.
• Establish and uphold data governance, quality standards, and compliance frameworks throughout the platform.
• Develop monitoring, logging, and alerting systems for data pipelines and services, while contributing to CI/CD workflows for data deployment and automation.
• Propel the modernization of the data platform, focusing on performance, cost-effectiveness, and scalability.
• Adopt an AI-centric approach in your daily tasks, utilizing tools such as Claude, Cursor, and other contemporary AI assistants to deliver superior quality work efficiently.
• Design and execute data contracts and event flows in partnership with backend, platform, and engineering teams.
• Oversee the design and execution of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows.
• Integrate data services with APIs, middleware, and external systems to facilitate comprehensive data consumption.
• Collaborate with leadership on data strategy, translating technical intricacies into actionable decisions for others.
• Work closely with engineering, analytics, AI, and product teams to ensure data platforms align with broader organizational objectives.
• Champion data quality, governance, and best practices for platforms across teams and initiatives.
• Set data engineering standards that enhance the quality and consistency of outputs across the team.
• Mentor junior and mid-level engineers, aiding in their professional growth, confidence, and impact.
• Make pivotal architectural decisions with clear accountability and consideration of long-term implications.
• A minimum of 7 years of professional experience in data engineering, with a focus on leading complex data platform projects.
• Strong background in system architecture with specialization in distributed data systems.
• Expert-level proficiency in Python, Scala, and SQL.
• Extensive knowledge of cloud-native data platforms and enterprise data warehousing.
• Strong expertise in orchestrating data pipelines and processing.
• Significant experience with streaming platforms and real-time data processing (e.g., Kafka, Kinesis, Pub/Sub).
• Profound data modeling skills and experience in data transformation.
• Solid experience with data quality, governance, and compliance frameworks.
• Strong background in container orchestration and CI/CD practices for data systems.
• Proven experience in building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows.
• Demonstrated leadership and technical mentoring experience within a team or organization.
• Excellent communication skills with the ability to convey complex technical concepts to diverse audiences.
• Regular, hands-on use and expert understanding of AI-enhanced coding tools such as Claude and Cursor.
• Exceptional problem-solving abilities and the capacity to navigate complex technical and business challenges with sound judgment.
• Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus.
• Competitive salary.
• Flexible working hours.
• Opportunities for professional development.
Aimpoint Digital
Power Digital Marketing
Get handpicked remote jobs straight to your inbox weekly.