Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Understanding on MMBench (dev)

80.41Accuracy

Qwen2.5-VL-3B

34.223646.214358.20570.1957Feb 18, 2024Jun 5, 2024Sep 21, 2024Jan 7, 2025Apr 25, 2025Aug 11, 2025Nov 28, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.11
80.41--
2025.11
79.81--
2025.11
78.18--
2025.11
77.58--
2024.02
77.2--
2024.02
76.7--
2024.02
75.1--
2024.02
74.9--
2024.11
74.9--
2024.02
74.2--
2024.02
73.3--
2024.02
73.2--
2024.02
72.3--
2024.02
72.1--
2024.02
70--
2024.02
69.6--
2024.11
69.4--
2025.11
69.24--
2024.02
68.6--
2024.02
68.6--
2024.02
68.5--
2024.02
68.5--
2024.02
68--
2024.02
67.8--
2024.02
67.4--
2024.02
66.5--
2024.02
66.1--
2025.09
64.7--
2024.11
64.3--
2024.02
63.4--
2024.02
63.2--
2024.11
63.2--
2025.09
63--
2025.09
62.89--
2025.09
62.54--
2025.09
62.5--
2025.09
62.03--
2025.09
62--
2025.09
61.86--
2025.09
61.2--
2024.11
60.6--
2025.09
60.22--
2025.09
60.1--
2025.09
60--
2024.11
59.8--
2024.11
59.6--
2025.09
59.28--
2024.02
58.8--
2024.02
57.9--
2024.11
57.7--
2025.09
56.2--
2025.09
56.1--
2024.11
53.2--
2024.11
48.2--
2025.09
48--
2024.11
38.7--
2024.02
36--
2024.11
36--
2024.03
-0.588-
2024.03
-0.677-
-0.606-
2024.03
-0.659-
2024.03
-0.716-
2026.02
-75.8-
2026.02
-76.9-
2026.02
-71.7-
2026.02
-81-
2026.02
-82.5-
2026.02
-64.7-
2026.02
-75.5-
2026.02
-77.4-
2026.02
-78.3-
2026.02
-76.4-
2026.02
-81.9-
2026.02
-84-
2026.02
-67.9-
2025.11
-55.76-
2025.11
-69.16-
2025.11
-51.29-
2025.11
-70.79-