Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Language Model Evaluation on MM-Bench EN

65.8MM-Bench (EN) Score

Vanilla

57.58459.71761.8563.983May 20, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.05
65.8100
2026.05
65.496.4
2026.05
6597.7
2026.05
64.296.1
2026.05
63.196.2
2026.05
61.686.8
2026.05
57.990.1