Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IMO-AnswerBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
MathIMO-ANSWERBENCH
Score53.8
9
Mathematical ReasoningIMO-AnswerBench
Accuracy84.5
9
Mathematical ReasoningIMO-AnswerBench
Pass@183.3
8
Reasoning & GeneralIMO-AnswerBench
Score86.3
7
Mathematical ReasoningIMO-AnswerBench
Pass@125.62
6
Mathematical ReasoningIMO-AnswerBench (test)
Pass@125.62
4
Showing 6 of 6 rows