
Data & AI Engineer
Posted May 23

Posted May 23
This is a fully remote position, open to applicants in Czechia.
β’ Create and sustain scalable data pipelines utilizing Databricks and Azure Data Lake Storage Gen2 (ADLS Gen2), ensuring reliability and optimal performance at every stage of the pipeline.
β’ Apply the Medallion Architecture (Raw β Silver β Gold layers) to clean, normalize, and structure technical documentation for downstream AI utilization.
β’ Design and construct the Reasoning Layer using frameworks such as LangChain or LlamaIndex to oversee LLM logic, prompt routing, and tool orchestration.
β’ Oversee and optimize Vector Databases (e.g., Pinecone, Weaviate, or Azure AI Search) to facilitate swift and precise semantic retrieval within the corporate knowledge base.
β’ Create Azure Apps and Azure Functions to link the AI system with external enterprise platforms like Salesforce and SharePoint, allowing for comprehensive automation.
β’ 3β4 years of experience in Data Engineering or Software Development, with a strong emphasis on cloud-based data processing.
β’ High level of expertise in Databricks and Python; adept at navigating the entire data engineering lifecycle.
β’ Practical experience with LLM orchestration frameworks: LangChain, LlamaIndex, or similar.
β’ Strong understanding of the Azure ecosystem β including Storage (ADLS Gen2), Identity (AAD), and Serverless (Azure Functions).
β’ Knowledge of modern AI coding assistants (GitHub Copilot, Cursor, OpenAI Codex) to enhance development speed.
β’ Flexible work arrangements
β’ Professional development opportunities
Credo AI
Get handpicked remote jobs straight to your inbox weekly.