Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multi-modal Reasoning on MMVet (test)

80.8Accuracy

GPT-4o

42.42452.38762.3572.313May 28, 2024Sep 3, 2024Dec 10, 2024Mar 19, 2025Jun 25, 2025Oct 1, 2025Jan 8, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.01
80.8-
2024.05
74.24-
2026.01
73.2-
2026.01
71.6440.7
2026.01
71.2114.1
2026.01
70.9118.8
2026.01
70.6137.9
2024.05
70.51-
2026.01
68.7-
2026.01
68.5312.7
2026.01
67.1132.5
2026.01
65.9166.3
2026.01
65.2108.4
2024.05
64.2-
2026.01
64112.7
2026.01
62132.5
2026.01
62117.6
2026.01
61.9-
2026.01
61.3138.8
2024.05
60.2-
2026.01
60-
2025.07
58.5-
2025.07
53.5-
2025.07
53.5-
2025.07
53-
2024.05
51.7-
2024.05
51.3-
2025.07
48-
2025.07
47.7-
2026.01
43.9218.3