Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-modal Question Answering on MMBench

86.4Accuracy

Qwen2.5vl-Instruct

24.41640.50856.672.692Nov 26, 2024Feb 24, 2025May 26, 2025Aug 25, 2025Nov 24, 2025Feb 23, 2026May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2025.12
86.4
2025.12
85.1
2025.10
84.19
84.02
2026.02
83.9
2026.02
83.9
2025.12
83.5
2025.10
83.41
2025.12
83.3
2025.08
83.2
82.73
2025.08
82.6
2025.08
82.1
2025.12
81.4
2026.02
80.3
2025.08
80.3
2025.08
80.2
2025.12
79
2025.08
77.3
2025.08
76.1
2025.08
75.9
2025.12
75.6
2025.08
75.6
2025.08
75.6
2026.02
75.52
2025.12
75
2026.02
74.57
2025.08
73.7
2025.08
73.5
2026.02
73.28
2026.02
71.05
2026.05
69.87
2025.08
69.8
2025.08
69.7
2026.05
69.02
2026.05
68.48
2024.11
67.9
2026.05
67.73
2026.05
66.91
2026.05
66.73
2026.05
66.43
2026.04
65.4
2026.04
64.7
2024.11
64.3
2026.04
63.9
2026.04
63.7
2026.04
63.5
2026.05
63.35
2026.04
63.3
2026.04
63.3
2026.04
63.2
2026.05
63.04
2026.04
63
2026.04
62.8
2026.04
62.5
2026.04
62.3
2026.04
62.1
2026.04
62
2026.04
61.8
2026.04
61.6
2026.05
61.24
2026.04
60.8
2026.05
60.64
2024.11
60.6
2026.04
60.1
2026.04
60
2026.04
59.6
2024.11
58.8
2026.04
58.8
2026.04
58.1
2026.05
57.54
2026.05
57.48
2026.05
57.29
2026.04
56.2
2025.12
55.5
2026.04
55.3
2026.04
55.2
2026.05
50.3
2024.11
38.2
2024.11
36
2026.02
35.14
2026.02
34.45
2026.02
34.36
2026.02
26.8