Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Math Reasoning on MathVista (avg@8 accuracy)

69.53Avg@8 Accuracy

PAPO_G

52.94257.248561.55565.8615Jul 8, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.07
69.53
2025.07
67.53
2025.07
65.48
2025.07
62.53
2025.07
61.91
2025.07
61.38
2025.07
60.89
2025.07
59.34
2025.07
56.08
2025.07
53.58