
AI Evaluation Engineer – Business & Operations Domain
Posted Jun 4

Posted Jun 4
This is a fully remote position, open to applicants in Egypt.
• Design realistic business and operational workflow scenarios for AI evaluation systems.
• Create structured tasks that encompass analytics, reporting, operational reasoning, and process optimization.
• Develop precise task specifications, anticipated outcomes, and validation logic.
• Identify operational edge cases, bottlenecks, and scenarios of workflow failure.
• Assess AI-generated outputs for reasoning quality, relevance, and accuracy.
• Contribute expertise in business operations, analytics, automation, or operational systems.
• Review and enhance workflow complexity, clarity, and evaluation quality.
• Collaborate with reviewers and researchers to refine AI benchmark scenarios.
• Assist in creating realistic multi-step business and operational problem-solving tasks.
• 3–10 years of experience in operations, analytics, consulting, business systems, or equivalent fields.
• Strong analytical thinking and operational problem-solving capabilities.
• Experience with operational workflows, reporting systems, CRM tools, or business analytics.
• Solid understanding of cross-functional business processes and their interdependencies.
• Proficiency in spreadsheets, dashboards, operational reporting, or workflow automation.
• Excellent written communication and documentation abilities.
• Familiarity with AI systems, automation platforms, or evaluation workflows is preferred.
• Capability to design realistic and structured operational scenarios for evaluation purposes.
• Competitive salary and performance-based bonuses.
• Opportunities for professional development and career advancement.
• Flexible work hours and remote work options.
• Comprehensive health and wellness benefits.
• Engaging work environment with a focus on innovation and collaboration.
10x.Team
10x.Team
Anyone AI
Anyone AI
Get handpicked remote jobs straight to your inbox weekly.