Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Reasoning on MathVista (Accuracy)

79.2Accuracy

AutoNPO

67.96870.88473.876.716Nov 13, 2025Dec 9, 2025Jan 5, 2026Feb 1, 2026Feb 27, 2026Mar 26, 2026Apr 22, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.04
79.2
2026.04
78.5
2026.04
77.3
2026.04
76.6
2026.04
76.3
2026.04
76.2
73.8
2026.04
73.8
2025.11
70.3
2025.11
70.1
2025.11
69.8
2025.11
69.3
2025.11
69.3
2025.11
69.1
2025.11
69
2025.11
68.4