Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Utility Evaluation on MMbench and DocVQA (test)

87.02MMbench Score

Full Model

83.754484.602285.4586.2978May 22, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
87.0294.5190.76
2025.05
85.1591.9788.56
2025.05
85.0192.1388.57
2025.05
84.6292.988.76
2025.05
84.5592.9388.74
2025.05
83.8890.6487.26
2025.05
83.8890.6387.25