
AI Implementation Quality Analyst
Posted Jun 21

Posted Jun 21
This is a fully remote position, open to applicants in United States.
β’ Collaborate closely with clients during implementation to comprehend use cases, success metrics, and risk tolerance.
β’ Convert client specifications into evaluation frameworks, prompt strategies, and testing coverage.
β’ Serve as the quality intermediary between the client, product team, and engineering to ensure consistency both pre- and post-launch.
β’ Assess and score AI responses using established rubrics.
β’ Validate against sources/ground truth; identify hallucinations, omissions, and other potential risks.
β’ Document observations and calibrate with colleagues to maintain consistent scoring practices.
β’ Develop and sustain prompt banks and golden sets (expected outcomes).
β’ Broaden coverage for edge cases, high-risk situations, and authentic user language.
β’ Monitor regression/drift and provide data for dashboards and quality reports.
β’ Prioritize internal and client feedback; distill themes across various deployments.
β’ Recognize systemic risks and escalate significant findings with clear, client-relevant context.
β’ Collaborate with product and engineering teams to validate and confirm fixes, ensuring alignment with client expectations.
β’ Strong analytical judgment and a keen attention to detail.
β’ Exceptional written communication skills, including the ability to articulate reasoning clearly.
β’ Experience in reviewing, auditing, or assessing structured outputs such as content, decisions, or recommendations.
β’ Comfort with applying detailed guidelines and rubrics consistently at scale.
β’ Familiarity with large language models and common failure modes like hallucinations, overgeneralization, or unsafe responses.
β’ Proficiency in using spreadsheets, evaluation tools, or annotation platforms.
β’ Experience with AI evaluation, data annotation, quality assurance, trust and safety, or policy review.
β’ Exposure to human-in-the-loop workflows or benchmarking procedures.
β’ Domain expertise in a regulated or high-risk sector such as government, education, healthcare, or legal services.
β’ Experience in contributing to test suites, evaluation dashboards, or quality reporting.
β’ Ability to work collaboratively across functions with product and engineering teams.
β’ Experience in client-facing roles such as implementation, consulting, or solution delivery in a SaaS or regulated environment.
β’ Capacity to translate between business/user needs and technical system behaviors.
β’ Flexible Time Off β Take the time you need to rest, recharge, and live your life.
β’ Company-Wide Wellbeing Days β Paid days off to unplug and focus on your mental health.
β’ Work From Home Reimbursement β Support a productive home office environment.
β’ Multiple Health Plan Options β Including a 100% employer-paid plan.
β’ Employer HSA Contributions β When enrolled in a High-Deductible Health Plan.
β’ Fitness Reimbursement Program β Stay active, your way.
β’ On-Demand Mental Health Support β Access to Headspace and other wellness tools.
β’ Paid Parental Leave β For both birthing and non-birthing parents.
β’ Traditional & Roth 401(k) β With a generous company match.
β’ Life & AD&D Insurance β 100% employer-paid coverage for peace of mind.
β’ Online Learning Platforms β Fuel your professional development.
β’ Competitive Salary & Bonuses β Your contributions are valued and rewarded.
Jedox
Sentara Health
EverAI
SHI International Corp.
Get handpicked remote jobs straight to your inbox weekly.