Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Reasoning on MMBench (English and Chinese Scores)

84.29MMBench Accuracy (en)

Qwen3VL-2B-SFT

23.34639.16854.9970.812Mar 10, 2025May 23, 2025Aug 5, 2025Oct 19, 2025Jan 1, 2026Mar 16, 2026May 30, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
84.2981.52-
2026.05
82.7581.44-
2026.05
82.7280.49-
2026.05
82.5679.63-
2026.05
82.480.39-
2026.05
82.0179.83-
2026.05
81.3478.91-
2026.04
66.158.9-
2025.03
6658.9100
2026.04
65.757.6-
2025.03
64.856.597.9
2026.05
64.658.1100
2025.03
64.357.196.8
2026.05
63.857.699.4
2026.05
63.757.399
2026.05
63.45798.5
2026.04
63.154.5-
2026.04
63.155.8-
2026.05
63.157.198.7
2026.05
63.157.397.2
2026.05
6356.498.1
2026.05
62.957.499
2026.04
62.554.8-
2026.05
62.356.398.8
2026.05
62.356.696.5
2026.04
62.254.8-
2026.05
62.254.396.9
2026.05
6256.797.6
2026.05
6254.695.2
2026.04
61.453.8-
2026.05
61.453.993.4
2026.05
61.354.997.4
2026.05
61.25790
2025.03
6153.595.7
2025.03
60.656.7-
2025.03
60.553.995.5
2026.05
60.152.194
2026.05
6052.594.4
2025.03
58.8--
2025.03
58.248.593
2026.05
57.949.588.6
2025.03
57.250.691.8
2026.05
56.156.482.9
2026.04
55.252-
2026.05
55.145.990.4
2025.03
54.747.891.9
2025.03
54.538.1-
2026.04
53.247.4-
2026.04
52.248.5-
2026.04
5245.8-
2025.03
514890.3
2026.04
48.845.3-
2025.03
48.145.488.8
2026.05
4852.774
2025.03
3623.7-
2026.05
35.7435.09-
2025.03
31.4--
2026.05
28.8325.92-
2026.05
27.9527.95-
2026.05
25.825.29-
2026.05
25.6924.95-