Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Benchmarking on MMBench (Language Specific Scores)

67.09MMBench Score (en)

LLaVA-JPEG-901+

45.676451.235756.79562.3543Nov 8, 2024Feb 9, 2025May 14, 2025Aug 16, 2025Nov 18, 2025Feb 20, 2026May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2024.11
67.0957.3
2024.11
66.6758.93
2024.11
66.3258.59
2024.11
65.6457.56
2024.11
65.3756.7
2026.03
64.959.1
2026.03
64.856.5
2026.03
64.358.3
2026.03
64.357.1
2026.03
64.354.1
2024.11
64.1754.46
2026.03
63.355.5
2026.03
63.155.8
2024.11
63.0554.2
2026.03
6153.5
2026.03
60.553.9
2026.03
58.248.5
2026.03
57.250.6
2026.05
52.645.9
2026.05
5246.5
2026.03
5148
2026.03
48.145.4
2026.05
46.539.6