Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Reasoning on V*Bench

95.7Accuracy

ChatGPT-o3

46.359.12571.9584.775Jun 13, 2024Sep 22, 2024Jan 2, 2025Apr 14, 2025Jul 24, 2025Nov 3, 2025Feb 13, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
95.7-
2026.02
91.2-
2026.02
91.1-
2026.02
90.1-
2026.02
89.5-
2026.02
88.7-
2026.02
87-
2026.02
8755.4
2026.02
8767.5
2026.02
86.4-
2026.02
86.1-
2026.02
86.1-
2026.02
86.155
2026.02
86.158.2
2026.02
85.260
2026.02
85.262.5
2026.02
84.3-
2026.02
83.8-
2026.02
83.5-
2026.02
83.25-
2025.11
83.25-
2026.02
82.2-
2025.11
81.7-
2025.11
81.68-
2026.02
81.15-
2025.11
80.6-
2025.11
80.6-
2024.06
80.3-
2026.02
80.1-
2026.02
80.1-
2025.11
80.1-
2026.02
79.58-
2026.02
79.06-
2025.11
79.05-
2026.02
78.01-
2026.02
76.96-
2024.06
75.4-
2026.02
75.39-
2025.11
75.39-
2025.11
73.9-
2026.02
73.82-
2026.02
72.77-
2026.02
72.25-
2026.02
72.2-
2024.06
71-
2026.02
70.16-
2026.02
69.7-
2025.11
69.7-
2025.11
68.58-
2024.06
66-
2026.02
60.73-
2026.02
57.07-
2025.11
56.54-
55-
2026.02
52.88-
2024.06
52.5-
48.7-
2024.06
48.2-