Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NuminaMath

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningNuminaMath (val)
Accuracy21.7
8
Mathematical ReasoningNuminaMath subset of 5,000 samples
Accuracy73.94
3
Mathematical ReasoningNuminaMath
Accuracy60.9
1
Showing 3 of 3 rows