Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Evaluation on MMBench CN

74.3Accuracy

mPlug-Owl3

4.72422.78740.8558.913Jun 28, 2024Oct 14, 2024Jan 30, 2025May 18, 2025Sep 3, 2025Dec 20, 2025Apr 7, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2024.11
74.3
2024.11
71.8
2024.11
70.3
2024.11
70.1
2024.11
67.7
2026.02
63.3
2026.02
63.3
2026.02
63.1
2026.02
63.1
2026.02
62.9
2026.02
62.6
2026.02
62.6
2026.02
62.5
2026.02
62.5
2026.02
62.5
2026.02
62.5
2026.02
62.3
2026.02
61.9
2026.02
61.9
2026.02
61.2
2024.11
60.6
2026.02
60.6
2026.02
60.6
2024.06
59.9
2024.06
59.9
2026.02
59.9
2024.06
59.4
2024.06
59.3
2026.02
59.3
2024.06
59.1
2024.06
59.1
2024.06
59
2026.02
58.5
2026.02
58.5
2024.11
58.3
58.3
2026.04
58.3
2026.03
58.1
2025.12
57.6
2026.02
57.39
2026.03
57.3
2026.02
57.13
2026.03
57
2026.02
56.96
2026.03
56.9
2026.02
56.87
2026.02
56.87
2026.02
56.6
2026.02
56.4
2026.02
56.2
2026.02
55.9
2026.03
55.9
2025.12
55.8
2026.03
55.8
2026.03
55.8
2026.04
55.8
2026.03
55.5
2026.03
55.4
2026.03
55.3
2026.03
55.1
2024.11
54.9
2026.02
54.9
2026.02
54.4
2026.02
53.78
2026.03
53.6
2025.12
53.4
2026.04
52.8
2026.03
52.7
2026.02
52.3
2026.03
52.3
2026.03
52.3
2026.03
52.1
2026.02
51.72
2026.03
50.3
2026.04
50
2026.03
49.6
2026.03
49.1
2026.02
48.6
2026.03
45.9
2026.03
42.1
2026.03
36.6
2024.11
23.7
2024.11
7.4