
Principal AI/ML Researcher – Reasoning, Planning, Decision-making Systems
Posted 6 days ago

Posted 6 days ago
This is a fully remote position, open to applicants in United States.
• Lead foundational and applied research in reasoning engines, planning architectures, and large-scale decision-making frameworks.
• Enhance techniques in LLM/LRM post-training, reinforcement learning-based decision-making, and knowledge-integrated agents.
• Develop methodologies for plan induction, value estimation, and contingency modeling within intelligent agents.
• Investigate and validate protocols for distributed reasoning and collaborative planning among cooperative agents in multi-agent systems.
• Design RPD systems that combine post-trained LLMs/LRMs, graph-structured memory (e.g., KGs), and RL-driven controllers.
• Create recursive task planners, search-based or policy-based reasoners, and belief-state trackers that can function with large model substrates.
• Ensure modularity and extensibility through multi-agent frameworks, agentic substrates, and declarative planning pipelines.
• Establish communication protocols, coordination strategies, and cross-agent knowledge alignment mechanisms to promote emergent cooperative intelligence.
• Master’s degree or equivalent in Computer Science, AI, Cognitive Science, or related disciplines.
• Recent publications or patents in AI, Cognitive Science, or related fields.
• Over 15 years of experience in AI/ML, including post-training architectures and production-scale reasoning systems.
• Advanced coding skills in Java, Python, C++, or similar languages, with experience in ML/RL frameworks (e.g., PyTorch, Ray, JAX, RLlib) at scale.
• Demonstrated experience in integrating LLMs/LRMs with Knowledge Graphs or structured world models.
• Profound understanding of Reinforcement Learning and its applications in decision-making and planning.
• Proficiency in hybrid model architectures: connectionist-symbolic fusion, retrieval-based agents, or goal-directed transformers.
• Background in multi-agent coordination, distributed RL, or cooperative inference systems.
• This position may also qualify for bonuses, equity, benefits, and Employee Travel Credits.
Clariti
Aledade, Inc.
Geomagical Labs
Slingshot Aerospace
Get handpicked remote jobs straight to your inbox weekly.