Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Computer Vision Reasoning on CV-Bench

85.7Accuracy

GAMSI_S1+S2

41.81253.20664.675.994May 8, 2026May 10, 2026May 13, 2026May 16, 2026May 19, 2026May 22, 2026May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
85.7
2026.05
85.5
2026.05
85.4
2026.05
84.6
2026.05
84.5
2026.05
84
2026.05
83.9
2026.05
83.4
2026.05
83.4
82.4
2026.05
81.1
2026.05
76.9
75.3
2026.05
75.2
2026.05
74.9
2026.05
73.7
2026.05
73.6
68.4
68.1
66.3
2026.05
66
2026.05
64.9
63
56.7
2026.05
54.7
2026.05
43.5