Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Visual Reasoning on MMStar (Accuracy)

77.5Accuracy

Gemini 2.5 Pro

44.53253.09161.6570.209Jan 11, 2026Jan 15, 2026Jan 20, 2026Jan 25, 2026Jan 30, 2026Feb 4, 2026Feb 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
77.5
2026.02
65.27
2026.02
64.7
2026.01
64.7
2026.02
63.67
2026.02
63.53
2026.02
63.47
2026.01
63.2
2026.02
63.07
2026.02
63.07
2026.02
62.73
2026.01
62.67
2026.02
62.6
2026.02
61.53
2026.02
60.8
2026.01
60.33
2026.02
60.27
2026.01
60.27
2026.01
59.7
2026.01
59.13
2026.01
58.73
2026.02
58.33
2026.01
57.93
2026.02
57
2026.02
55.93
2026.02
54.2
2026.02
53.73
2026.01
53.33
2026.01
45.8