Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vision-language reasoning on MathVista (test)

34.6Accuracy

Top Probability + Confidence Modulation

24.40827.05429.732.346May 27, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2026.05
34.6
2026.05
33.2
2026.05
29.2
2026.05
29.1
2026.05
28.8
2026.05
27
2026.05
24.8