Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math-G

Benchmarks

Task NameDataset NameSOTA ResultTrend
General Mathematics ReasoningMath-G College-math, Math-OAI, Minerva-math (test)
Accuracy54.1
24
Showing 1 of 1 rows