Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MATH-Vision

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningMATH-Vision
Accuracy30.4
32
Mathematical problem solvingMATH-Vision (test)
Accuracy68.8
26
Multimodal ReasoningMATH-Vision (full)
Accuracy62.7
23
GeometryMATH-Vision (mini)
Score40.79
19
Mathematical ReasoningMATH-Vision mini (test)
ALG42.11
8
Multimodal Mathematical ReasoningMATH-Vision (testmini)
Alg Score21.05
8
Showing 6 of 6 rows