Remotery

AI QA Engineer – Calidad y Evaluación de IA Generativa

atBe.Change ConsultingCO flagColombiaFull-timeQA Engineer (Quality Assurance)Mid-levelSenior

Posted 1 day ago

📋 Description

• Design, validate, and enhance evaluation frameworks for AI agents.

• Implement automated and regression testing suites for generative models.

• Define and monitor quality metrics related to: Relevance, Fidelity, Consistency, Accuracy, and Hallucinations.

• Build evaluation systems like “LLM-as-a-Judge.”

• Establish performance benchmarks for new models and existing agents.

• Validate updates for prompts, models, and RAG pipelines.

• Collaborate with AI and development teams to define acceptance criteria (pass/fail).

• Analyze evaluation results and propose continuous improvements.

• Generate metric reports and traceability regarding agent quality.


⛳️ Requirements

• Minimum of 3 years of experience in QA automation, Data/AI Quality, or evaluation of AI systems.

• Advanced experience in Python.

• Experience working with AI evaluation frameworks such as: RAGAS, DeepEval, Vertex Gen AI Evaluation Service.

• Experience in evaluating RAG systems and LLM models.

• Ability to design “LLM-as-a-Judge” systems.

• Experience in test automation and validations.

• Knowledge in: Prompt evaluation, Response quality, Model benchmarking, and Testing of generative AI.

• Familiarity with metrics such as: Groundedness, Faithfulness, Context relevance, and Answer relevance.

• Experience working with non-deterministic systems.

• Desirable: Experience in conversational AI platforms.

• Knowledge of RAG pipelines.

• Experience with generative model APIs.

• Proficiency in observability and monitoring tools.

• Knowledge in MLOps or LLMOps.

• Experience in cloud environments (GCP, AWS, or Azure).


🏝️ Benefits

• Work mode: 100% Remote

• Excellent work environment

• Opportunities for growth and participation in innovative projects.

People also viewed

Zealogics Inc54 min ago

QA Engineer, Gen AI

US flagUnited States OnlyFull-timeQA Engineer (Quality Assurance)
ApplyView job
Compass54 min ago

Senior QA Test Automation

BR flagBrazil OnlyFull-timeQA Engineer (Quality Assurance)
ApplyView job
GSB Solutions54 min ago

Senior QA Analyst – Support

MX flagMexico OnlyFull-timeQA Engineer (Quality Assurance)
ApplyView job
B2Spin Limited54 min ago

Junior QA Engineer – Operational Specialist

UA flagUkraine OnlyFull-timeQA Engineer (Quality Assurance)
ApplyView job
Clario54 min ago

QA Engineer

IN flagIndia OnlyFull-timeQA Engineer (Quality Assurance)
ApplyView job
Hammerspace54 min ago

Senior Staff Software Engineer – QA

US flagUnited States OnlyFull-timeQA Engineer (Quality Assurance)$140k – $200k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers