
Senior AI Engineer
Posted May 20

Posted May 20
This is a fully remote position, open to applicants in Colombia.
• Design, create, and sustain production-quality LLM integration pipelines, which encompass retrieval-augmented generation (RAG), prompt engineering, output parsing, and chain orchestration.
• Develop and manage AI functionalities within Jeeves's core financial offerings, such as spend categorization, document extraction, anomaly detection, financial Q&A, and automated reconciliation.
• Implement structured output validation, fallback handling, and confidence scoring to guarantee that AI decisions adhere to reliability standards for financial applications.
• Assess and incorporate AI frameworks and tools (LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases) while advocating for the most suitable tool for each task.
• Establish prompt versioning and evaluation practices to ensure that AI outputs remain precise and consistent as models and data progress.
• Design and manage vector search pipelines utilizing databases like Pinecone, Weaviate, or pgvector to enhance semantic search and RAG-based features.
• Create document ingestion and chunking pipelines for Jeeves's financial data, processing invoices, receipts, policy documents, and transaction records.
• Improve retrieval quality through the selection of embedding models, chunk strategies, metadata filtering, and re-ranking techniques.
• Collaborate with data scientists to transition trained ML models from experimental notebooks to production serving infrastructure.
• Develop and maintain model serving endpoints while adhering to appropriate latency SLOs, input validation, and output monitoring.
• Implement model performance monitoring and data drift detection to ensure that production models maintain accuracy over time.
• Support model retraining workflows by creating clean data pipelines and feature engineering that can be continuously updated.
• Seamlessly integrate AI services with Jeeves's backend microservices, designing clear API contracts, circuit breakers, and graceful degradation patterns.
• Write high-quality, testable backend code in Python or Go/Node.js to support AI-integrated features.
• Instrument AI components with structured logging, distributed tracing, latency dashboards, and alerting to guarantee operational visibility.
• Develop human-in-the-loop review workflows for AI decisions requiring oversight, particularly for high-value financial actions.
• Collaborate with Product, Backend Engineering, and Data Science teams to define the AI roadmap and convert requirements into reliable systems.
• Foster a culture of quality by drafting design documents, reviewing peers' AI system designs, and sharing insights openly.
• Contribute to the growth of the AI engineering practice at Jeeves by establishing patterns, tools, and best practices for the broader team to utilize.
• Bachelor’s degree in Computer Science, Engineering, or a related discipline — or equivalent practical experience.
• Over 5 years of professional software engineering experience, with at least 3 years focused on AI/ML systems in a production setting.
• Hands-on experience in building and deploying LLM-powered applications using APIs like OpenAI, Anthropic, or Cohere in a production environment.
• Experience in designing and operating RAG pipelines, including chunking strategies, embedding models, and vector database integration (Pinecone, Weaviate, pgvector, or similar).
• Strong proficiency in Python for AI/ML workloads, with familiarity in at least one AI orchestration framework (LangChain, LlamaIndex, or equivalent).
• Experience with ML model serving infrastructure, including REST or gRPC inference endpoints, input/output validation, latency budgeting, and monitoring.
• Solid backend engineering foundations: REST APIs, relational databases (PostgreSQL preferred), asynchronous patterns, and cloud infrastructure (AWS, GCP, or Azure).
• Experience with observability tools such as structured logging, distributed tracing, and creating dashboards for AI system health.
• **Preferred Qualifications**
• Experience in fintech, financial services, or any regulated industry where AI reliability and auditability are essential.
• Familiarity with prompt evaluation frameworks, A/B testing of AI outputs, and monitoring model performance degradation in production.
• Experience with ML lifecycle management tools like MLflow, Weights & Biases, Vertex AI, or SageMaker.
• Knowledge of real-time data streaming (Kafka, Kinesis) for event-driven AI pipelines.
• Contributions to open-source AI tools, published technical writings, or presentations at AI/ML conferences.
• Prior experience in a startup or scale-up environment, comfortable with ambiguity and building foundational systems from scratch.
• Competitive salary and equity options.
• Flexible work schedule and remote work opportunities.
• Professional development and growth opportunities.
• Comprehensive health and wellness benefits.
• Collaborative and innovative work environment.
Credo AI
Get handpicked remote jobs straight to your inbox weekly.