Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Reasoning on MMBench

87Accuracy

Qwen3-VL-8B-Thinking

49.40459.164568.92578.6855Feb 21, 2024Jun 21, 2024Oct 20, 2024Feb 19, 2025Jun 20, 2025Oct 19, 2025Feb 18, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
87-
2026.02
82.9-
2024.08
82.7-
2024.08
82.5-
2024.09
80.4-
2024.09
79.2-
2024.09
76.6-
2024.09
76.1-
2024.09
72.7-
2024.09
70.8-
2026.02
70.7-
2024.08
69.3-
2024.09
69.1-
2024.08
68.7-
2024.09
68.6-
2024.09
68-
2024.09
67.8-
2024.09
66.9-
2024.09
66.9-
2024.09
66.5-
2024.08
66.1-
2025.12
65.9-
2025.12
65.2-
2024.09
64.6-
2024.09
64.6-
2025.12
64.5-
2025.12
64.4-
2025.12
64.4-
2024.08
64.3-
2025.12
64.3-
2025.12
64.2-
2024.09
64.1-
2025.12
64.1-
2024.09
64-
2025.12
64-
2025.12
63.8-
2024.09
63.2-
2024.09
62.8-
2024.09
59.8-
2024.09
59.8-
2024.02
59.78-
2024.09
59.7-
2024.09
59.6-
2024.02
59.1-
2024.09
57.7-
2024.02
57.06-
2024.02
56.46-
2024.09
53.2-
2024.09
52.1-
2024.02
50.85-
2023.11
-64.5
2023.11
-61.2
2023.11
-36
2023.11
-25.3
2023.11
-40.3
2023.11
-67
2023.11
-66
2023.11
-70.7
2023.11
-65.9
2023.11
-65.5
2023.11
-68.3
2023.11
-75.5
2023.11
-74.8
2023.11
-76.3
2024.02
-67.4
2024.02
-60.1
2024.02
-65.4
2024.02
-65.1
2024.02
-67.6
2024.02
-66.8
2025.01
-82.4
2025.01
-75.4
2025.01
-71.5
2025.01
-76.9
2025.01
-77.4
2025.01
-76.3
2025.01
-75.7
2025.01
-76.8
2025.01
-75.3
2025.01
-75.34
2025.01
-75
2025.01
-65.8
2025.01
-79.89
2025.12
-79.96
2025.12
-80.56
2025.12
-82.59
2025.12
-83.36