Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Visual Reasoning on MMStar (Accuracy)

77.5Accuracy

Gemini 2.5 Pro

44.53253.09161.6570.209Jan 11, 2026Jan 15, 2026Jan 20, 2026Jan 25, 2026Jan 30, 2026Feb 4, 2026Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
77.5
2026.02
65.27
2026.02
64.7
2026.01
64.7
2026.02
63.67
2026.02
63.53
2026.02
63.47
2026.01
63.2
2026.02
63.07
2026.02
63.07
2026.02
62.73
2026.01
62.67
2026.02
62.6
2026.02
61.53
2026.02
60.8
2026.01
60.33
2026.02
60.27
2026.01
60.27
2026.01
59.7
2026.01
59.13
2026.01
58.73
2026.02
58.33
2026.01
57.93
2026.02
57
2026.02
55.93
2026.02
54.2
2026.02
53.73
2026.01
53.33
2026.01
45.8