Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multimodal Evaluation on MMBench CN

74.3Accuracy

mPlug-Owl3

4.72422.78740.8558.913Jun 28, 2024Oct 6, 2024Jan 15, 2025Apr 26, 2025Aug 5, 2025Nov 14, 2025Feb 23, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2024.11
74.3
2024.11
71.8
2024.11
70.3
2024.11
70.1
2024.11
67.7
2026.02
63.3
2026.02
63.3
2026.02
63.1
2026.02
63.1
2026.02
62.9
2026.02
62.6
2026.02
62.6
2026.02
62.5
2026.02
62.5
2026.02
62.5
2026.02
62.5
2026.02
62.3
2026.02
61.9
2026.02
61.9
2026.02
61.2
2024.11
60.6
2026.02
60.6
2026.02
60.6
2024.06
59.9
2024.06
59.9
2026.02
59.9
2024.06
59.4
2024.06
59.3
2026.02
59.3
2024.06
59.1
2024.06
59.1
2024.06
59
2026.02
58.5
2026.02
58.5
2024.11
58.3
58.3
2025.12
57.6
2026.02
57.39
2026.02
57.13
2026.02
56.96
2026.02
56.87
2026.02
56.87
2026.02
56.6
2026.02
56.4
2026.02
56.2
2026.02
55.9
2025.12
55.8
2024.11
54.9
2026.02
54.9
2026.02
54.4
2026.02
53.78
2025.12
53.4
2026.02
52.3
2026.02
51.72
2026.02
48.6
2024.11
23.7
2024.11
7.4