Share your thoughts, 1 month free Claude Pro on usSee more

Multi-agent planning and execution on MAT2-THOR Vague Tasks

71Task Completion Rate (TCR)

Scale Plan

Updated 1mo ago

Evaluation Results

Method	Links
Scale Plan 2026.03		71	71	82
LLM as a Planner 2026.03		57	57	88
LaMMA-P (PDDL plan only) 2026.03		57	64	77
LLM+P (PDDL based) 2026.03		43	52	58
LaMMA-P (LLM corrected) 2026.03		43	50	85