Share your thoughts, 1 month free Claude Pro on usSee more

High-level Planning on ReasonMap S (short questions)

15.44Weighted Accuracy

SFT VLM

Updated 3mo ago

Evaluation Results

Method	Links
SFT VLM 2025.11		15.44	25	3.79
Ariadne 2025.11		14.5	43	3.67
Base VLM 2025.11		13.32	26	3.73