Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MathInstruct

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningMathInstruct Scenario 1
Accuracy68.4
53
Mathematical ReasoningMathInstruct Scenario 4
Accuracy82.6
8
Mathematical ReasoningMathInstruct Scenario 3
Accuracy83.2
8
Mathematical ReasoningMathInstruct Scenario 2
Accuracy83.8
8
Showing 4 of 4 rows