Remotery

AI Evaluation Engineer – Planning & Operations

Posted May 23

This is a fully remote position, open to applicants in Bangladesh.

📋 Description

• Design and develop **multi-agent benchmark tasks** that encompass:

• Planning, scheduling, and resource allocation.

• Operational decision-making in areas such as logistics, project planning, incident response, and capacity planning.

• Create **constraint-rich problem statements** that feature multiple interacting variables.

• Develop **verification scripts** to assess:

• Feasibility (ensuring all constraints are satisfied).

• Completeness (confirming all requirements are met).

• Optimality (evaluating the efficiency of solutions).

• Define **task decomposition strategies** across specialized sub-agents, such as resource allocation, constraint resolution, and optimization.

• Model realistic operational systems that incorporate **dependencies, timelines, and constraints**.

• Implement validation logic and evaluation pipelines utilizing Python.

• Work within Docker environments to ensure reproducibility and execution.

• Collaborate with internal teams to enhance **task quality, coverage, and evaluation rigor**.


⛳️ Requirements

• A minimum of 5 years of experience in **operations, project management, logistics, or supply chain**.

• Strong capability to **formalize constraints, dependencies, and scheduling logic**.

• Proficiency in **Python** for developing validation and verification scripts.

• Familiarity with **optimization techniques** such as linear programming, constraint satisfaction, and scheduling algorithms.

• Excellent **structured problem-solving and decomposition abilities**.

• Experience with **AI benchmarks or evaluation frameworks** (e.g., SWE-bench or similar).

• Practical experience with **Docker** (including Dockerfiles, image builds, and debugging).

• **Nice to Have**:

• Background in **operations research or domains that heavily involve optimization**.

• Experience with **simulation or modeling tools**.

• Familiarity with **AI planning systems or automated reasoning**.

• Project management experience or relevant certifications (PMP, Agile, etc.).


🏝️ Benefits

• Competitive salary and performance-based bonuses.

• Opportunities for professional development and continuous learning.

• Flexible working hours and remote work options.

• Comprehensive health and wellness benefits.

• Collaborative and innovative work environment.

People also viewed

Anyone AI47 min ago

Physics Expert – AI Trainer

CO flagColombia OnlyPart-timeArtificial Intelligence$40/hour
ApplyView job
Sigma AI47 min ago

Indian English Linguistic Project

IN flagIndia OnlyFull-timeArtificial Intelligence
ApplyView job
Teamified47 min ago

AI Automation Specialist

PH flagPhilippines OnlyFull-timeArtificial Intelligence
ApplyView job
Stefanini Brasil47 min ago

Artificial Intelligence Analyst

BR flagBrazil OnlyFull-timeArtificial Intelligence
ApplyView job
10x.Team1 hour ago

IAM Consultant – AI Trainer – Freelance

NL flagNetherlands OnlyFreelanceArtificial Intelligence€90 – €158/hour
ApplyView job
10x.Team1 hour ago

PR Specialist – AI Trainer, Freelance

FR flagFrance OnlyFreelanceArtificial Intelligence€75 – €130/hour
ApplyView job

Never miss a great job!

Get handpicked remote jobs straight to your inbox weekly.

Trusted by 7,400+ designers