Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Reasoning on VisualProbe Hard

0.434Accuracy

Deepconf

0.099120.186060.2730.35994Feb 13, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
0.43414.7
2026.02
0.425-
2026.02
0.42522.5
2026.02
0.4256.7
2026.02
0.42516.2
2026.02
0.3969.2
2026.02
0.37712
2026.02
0.351-
2026.02
0.349-
2026.02
0.302-
2026.02
0.30229.7
2026.02
0.3028.6
2026.02
0.30244.6
2026.02
0.29240.3
2026.02
0.283-
2026.02
0.27415
2026.02
0.27414.5
2026.02
0.112-