Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MathQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical reasoningMathQA
Accuracy90
95
Math Word Problem solvingMathQA (test)
Accuracy81.5
34
Mathematical ReasoningMathQA (test)
Accuracy87.6
33
Correctness PredictionMathQA
Accuracy66.15
18
Question AnsweringMathQA (test)
Accuracy81.05
16
Question AnsweringMathQA
Accuracy78.7
12
Math ProgrammingMathQA Python
Pass@8087.4
8
Zero-shot ReasoningMathQA
Accuracy28.4
7
Downstream TaskMathQA
Accuracy24.32
7
Numerical Question AnsweringMathQA (test)
Program Accuracy83
6
Common Sense ReasoningMathQA
Accuracy64
4
Code GenerationMathQA Python Original (test)
Pass@8084.7
4
Human EvaluationMathQA
Accuracy89.2
3
Code GenerationMathQA Python Filtered (dev)
PASS@120.7
3
Showing 14 of 14 rows