Remotery

AI QA Engineer – Calidad y Evaluación de IA Generativa

Posted May 14

This is a fully remote position, open to applicants in Colombia.

📋 Description

• Design, validate, and enhance evaluation frameworks for AI agents.

• Implement automated and regression testing suites for generative models.

• Define and monitor quality metrics related to: Relevance, Fidelity, Consistency, Accuracy, and Hallucinations.

• Build evaluation systems like “LLM-as-a-Judge.”

• Establish performance benchmarks for new models and existing agents.

• Validate updates for prompts, models, and RAG pipelines.

• Collaborate with AI and development teams to define acceptance criteria (pass/fail).

• Analyze evaluation results and propose continuous improvements.

• Generate metric reports and traceability regarding agent quality.


⛳️ Requirements

• Minimum of 3 years of experience in QA automation, Data/AI Quality, or evaluation of AI systems.

• Advanced experience in Python.

• Experience working with AI evaluation frameworks such as: RAGAS, DeepEval, Vertex Gen AI Evaluation Service.

• Experience in evaluating RAG systems and LLM models.

• Ability to design “LLM-as-a-Judge” systems.

• Experience in test automation and validations.

• Knowledge in: Prompt evaluation, Response quality, Model benchmarking, and Testing of generative AI.

• Familiarity with metrics such as: Groundedness, Faithfulness, Context relevance, and Answer relevance.

• Experience working with non-deterministic systems.

• Desirable: Experience in conversational AI platforms.

• Knowledge of RAG pipelines.

• Experience with generative model APIs.

• Proficiency in observability and monitoring tools.

• Knowledge in MLOps or LLMOps.

• Experience in cloud environments (GCP, AWS, or Azure).


🏝️ Benefits

• Work mode: 100% Remote

• Excellent work environment

• Opportunities for growth and participation in innovative projects.

People also viewed

Pennant11 hours ago

Quality Assurance Registered Nurse, Home Health

US flagCalifornia OnlyFull-timeQA Engineer (Quality Assurance)
ApplyView job
UL Solutions11 hours ago

Research Scientist III – QA-QC, Fire Safety

US flagUnited States OnlyFull-timeQA Engineer (Quality Assurance)$89.6k – $123.2k/year
ApplyView job
BMO U.S.11 hours ago

Penetration Tester

US flagTexas OnlyFull-timeQA Engineer (Quality Assurance)$88.8k – $165.6k/year
ApplyView job
US Anesthesia Partners11 hours ago

Anesthesia Coding QA Specialist III

US flagTexas OnlyFull-timeQA Engineer (Quality Assurance)$60.8k – $103.4k/year
ApplyView job
Parallax Creative11 hours ago

Rhino/Revit BIM QA Lead

US flagUnited States OnlyFreelanceQA Engineer (Quality Assurance)$400 – $600/year
ApplyView job
Empower11 hours ago

Automation Quality Engineer

US flagUnited States OnlyFull-timeQA Engineer (Quality Assurance)$72.2k – $102k/year
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers