Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Reasoning on MathVision (test)

47.7Accuracy (%)

Nanobanana-Pro

15.04423.5223240.478Dec 8, 2025Dec 21, 2025Jan 3, 2026Jan 17, 2026Jan 30, 2026Feb 12, 2026Feb 26, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.01
47.7
2026.01
47
2026.01
46.9
2026.01
46.9
46.1
2026.01
45.9
41.3
2026.01
39
2025.12
38.1
2026.02
32.96
2025.12
32.9
2026.02
32.9
2026.02
32.83
2025.12
32.3
2026.02
32.25
2026.02
32.11
2026.02
31.92
2026.02
31.5
2026.02
30.4
2025.12
30.2
2026.02
29.96
2026.02
29.83
2026.02
29.5
2026.02
28.2
2025.12
27.6
2025.12
27.2
2026.02
27
2025.12
26.9
2026.02
25.4
2026.02
25.4
2026.02
25.4
2025.12
25.3
2026.02
25.3
2025.12
25.1
25
2025.12
24.7
2025.12
22.2
2025.12
21.4
2025.12
21.2
2026.02
20.2
2025.12
19.7
2025.12
18.5
18.2
2025.12
17.4
2025.12
16.3