Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on MathVista (Acc@1/Acc@4)

81.9Top-1 Accuracy

GPT-5-Thinking*

61.51666.80872.177.392Apr 1, 2026
Updated 17d ago

Evaluation Results

MethodLinks
2026.04
81.986.1
2026.04
80.985.2
2026.04
77.982.4
2026.04
73.575.6
2026.04
68.272.8
2026.04
6870.8
2026.04
64.372.5
2026.04
64.169.4
2026.04
62.378.5