Share your thoughts, 1 month free Claude Pro on usSee more

Multi-step Reasoning on GSM-Hard

71.5Accuracy

eMoT

Updated 1mo ago

Evaluation Results

Method	Links
eMoT 2026.06		71.5
BoT 2026.06		62
PaL 2026.06		61.2
ToT 2026.06		24.4
Qwen-32B (Direct) 2026.06		15.9