Share your thoughts, 1 month free Claude Pro on usSee more

Long-horizon procedural planning on EgoPlan-Bench In-Domain

62.46Success Rate

PlanAgent + Mem.

Updated 4mo ago

Evaluation Results

Method	Links
PlanAgent + Mem. 2026.03		62.46
PlanAgent + Mem. 2026.03		56.49
GPT-5.1 2026.03		55.08
Video-LLaMA 2026.03		54.65
Video-LLaMA 2026.03		52.14
PlanAgent + Mem. 2026.03		44.68
GPT-4V 2026.03		38.4
PlanAgent (Ours) 2026.03		36.6
Video-LLaMA 2026.03		27.88