Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-step Reasoning on multi-step reasoning tasks

61.6Average Score

EmbodiedAct

43.29648.04852.857.552Feb 24, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
61.651.1
2026.02
53.333.3
2026.02
4415.6