Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on MathVista mini (test)

80.2Accuracy

Qwen2.5-VL-32B + AT-RL (Ours)

23.10437.92752.7567.573Mar 27, 2024Jul 24, 2024Nov 20, 2024Mar 19, 2025Jul 16, 2025Nov 12, 2025Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
80.2-----
2026.03
79.5---74.284.7
2026.02
78.3-----
2026.02
77.8-----
2026.02
75.7-----
2026.03
74.1---65.381.7
2026.02
73.4-----
2026.03
73.3---65.780.1
2026.01
70.8-----
2026.02
70.7-----
2026.03
70.7---69.374.7
2026.01
70.3-----
2026.03
69.8---68.473.7
2026.01
69.2-----
2026.03
68.9-----
2026.01
68.6-----
2026.01
68.2-----
2026.02
67.7-----
2024.08
67.5-----
2024.10
66.1-----
2026.03
64.7---62.366.8
2026.03
64.1---61.766.2
63.9-----
2024.08
63.8-----
2024.10
63.8-----
2024.08
63.2-----
2026.01
62.6-----
2026.01
62.2-----
2026.01
62-----
2026.02
60-----
58.4-----
58-----
54.7-----
2024.05
52.1-----
2024.05
51.6-----
51.5-----
2024.05
51.4-----
2025.05
51.1-----
2024.03
49.9-----
2024.08
49.9-----
2025.05
47.9-----
2024.05
47.9-----
2025.05
47.8-----
2024.03
46.5-----
2024.05
46.5-----
2024.05
46.4-----
2024.03
45.2-----
2024.05
45.2-----
2024.03
43.3-----
2024.03
43.3-----
2024.05
43.3-----
2024.03
43.1-----
2024.03
41.8-----
2024.05
39.4-----
2024.03
38.9-----
2024.03
37-----
2024.03
37-----
2024.05
36.1-----
2024.05
35.9-----
2024.03
35.3-----
2024.05
35.3-----
2024.08
34.8-----
2024.03
34.6-----
2024.03
34.5-----
2024.03
32.2-----
2024.03
31.4-----
2024.06
30.6-----
2024.06
30.2-----
2024.06
30.2-----
2024.03
29.4-----
2024.06
27.7-----
2024.03
27.6-----
2024.06
27.6-----
2024.06
27.5-----
2024.03
25.3-----
2026.02
-52.661.650.8--
2026.02
-55.257.955.5--
2026.02
-48.868.837.5--
2026.02
-53.368.142.1--
2026.02
-60.465.161.6--
2026.02
-60.665.961.3--
2026.02
-50.150.450.3--
2026.02
-50.449.651.1--
2026.02
-35.838.636--
2026.02
-36.538.136.2--