Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Question Answering on MMStar

82.96Accuracy

Gemini 3-Pro

28.328842.511956.69570.8781Jun 20, 2024Oct 2, 2024Jan 15, 2025Apr 30, 2025Aug 13, 2025Nov 26, 2025Mar 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
82.96
2026.02
82.1
2026.01
79.4
77.5
2026.02
76.88
2026.01
76.5
2026.02
75.54
2026.01
75.5
2026.01
75.3
2026.01
75.2
2026.01
74.1
2026.01
73.3
2026.01
72.8
2026.02
71.2
70.3
2026.01
69.3
2026.01
69
2026.01
67.7
2026.02
64.8
2026.02
64.7
2024.07
64.1
61.9
2024.06
61.6
60.3
2024.07
59.9
2024.06
57.1
2024.07
57.1
2024.07
57.1
2024.06
56.2
2024.06
51.6
2024.06
49.7
2026.02
49.03
2026.02
48.79
2026.02
48.18
2026.02
46.55
2024.06
45.9
2024.06
43.7
2024.06
43.3
2024.06
42
2024.06
41.9
2026.03
41.67
2026.03
40.73
2024.06
40.7
2024.06
40.5
2024.06
40.5
2024.06
40.4
2026.03
40.2
2024.06
39.1
2026.03
38.8
2024.06
38.7
2026.03
38.53
2026.03
38.47
2024.06
38.4
2024.06
38.3
2024.06
37.6
2026.02
36.14
2026.02
35.54
2024.06
34.8
2024.06
34.5
2024.06
34.3
2026.02
33.97
2024.06
33.1
2026.02
30.43