
Data Engineer
Posted Jun 12

Posted Jun 12
This is a fully remote position, open to applicants in Indonesia.
• **Data Engineering**
• - Develop Python scripts to gather and process data from diverse academic, governmental, or commercial sources to support Meridia’s product development.
• - Ensure the accuracy and high quality of metadata on the company's internal datasets through the use of scripts, automation, and selected tools.
• - Create internal tools and scripts to automate data processing while utilizing technologies such as Airflow, AWS, and Google Cloud to enhance and scale ETL tasks for geospatial data.
• - Manage the team’s git repository of ETL scripts and follow version control best practices to guarantee that code is well-organized, accessible, and reproducible for the entire team.
• - Improve data ingestion processes to ensure reliable, timely, and scalable delivery of data into downstream products and analytics workflows, especially for large-scale raster and vector datasets.
• - Monitor ingestion pipelines with logging, alerting, and metrics to identify failures or data inconsistencies.
• - Contribute to our internal data catalogs, manuals, dictionaries, glossaries, and a broader set of terminology in a consistent manner across countries, products, and projects.
• - Promote continuous enhancements in operational efficiency by pinpointing bottlenecks and implementing effective solutions.
• **Core Data Engineering Skills**
• - Proficient in constructing, optimizing, and maintaining scalable ETL/data pipelines, particularly for large and intricate geospatial datasets, utilizing modern orchestration and workflow tools like Apache Airflow.
• - Experienced in cloud platforms (e.g., AWS), with practical knowledge in architecting data storage, transformation, and metadata solutions securely and cost-effectively using Infrastructure as Code tools such as Terraform.
• - Adept in Python and relevant data engineering frameworks (e.g., Pandas, Pydantic, Rasterio, Tippecano) for automating data ingestion, cleaning, standardization, and geospatial transformation.
• - Strong understanding of data governance and best practices in version control (e.g., Git), documentation, data quality assurance, reproducibility, and collaboration within cross-functional data teams.
• **Learning and Development**
• - The ideal candidate should be enthusiastic and motivated to learn and develop in the role. You will participate in on-the-job training and development to gain insights into the land rights sector and our data systems and workflows. Mentorship will be provided by our Data Engineering team in Indonesia and senior colleagues in the Netherlands.
• **The ideal candidate has/is**
• - Over 4 years of experience in a related role.
• - Strong problem-solving and analytical abilities.
• - Capability to efficiently collect and organize data from various sources.
• - Knowledge of data management and data entry processes.
• - Self-driven, proactive, accountable, and a dependable team player.
• - Full working proficiency in English.
• **The following are an advantage:**
• - Familiarity with GIS scripting (Python, Geopandas, PostGIS) and GIS software such as QGIS ETL experience.
• - Proven projects or case studies related to forest monitoring or land-use analysis.
• - Experience with data version control tools like DVC.
• - Experience in a start-up environment.
• **The benefits package includes**
• - Competitive salary aligned with your skills and competencies.
• - A four-day workweek.
• - A remote working opportunity.
• - The chance to be part of a rapidly growing impact venture with a casual yet professional work culture.
• - Provision of a laptop and an external monitor for your work.
• - An annual employee-led Learning & Development budget.
• - Engaging with team members, clients, and users across various countries.
Aimpoint Digital
Power Digital Marketing
Get handpicked remote jobs straight to your inbox weekly.