Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Visual Logical Reasoning on LogicVista

52.8Accuracy

GPT4-o

31.58437.09242.648.108Jan 1, 2026Jan 7, 2026Jan 14, 2026Jan 21, 2026Jan 28, 2026Feb 4, 2026Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
52.8--
2026.01
52.3--
2026.02
48.44--
2026.02
48.21--
2026.01
48.2--
2026.01
47.9--
2026.01
47.4--
2026.02
46.65--
2026.01
46.3--
2026.02
46.21--
2026.01
45.8--
2026.01
45.8--
2026.01
45.5--
2026.01
45.1--
2026.01
45.1--
2026.02
45.09--
2026.01
43.2--
2026.02
43.08--
2026.02
41.29--
2026.01
40.9--
2026.02
40.62--
2026.01
39.4--
2026.02
39.29--
2026.01
37.7--
2026.01
37.4--
2026.02
37.28--
2026.01
37.1--
2026.01
32.4--
2026.01
-25.219
2026.01
-20.625.3
2026.01
-25.219
2026.01
-20.427.9
2026.01
-2917.7
2026.01
-34.622.8
2026.01
-33.619
2026.01
-2831.6
2026.01
-21.525.3