Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning on ARC Challenge

96.7Accuracy

GPT-4o

14.74836.02457.378.576Nov 18, 2023Jan 31, 2024Apr 15, 2024Jun 29, 2024Sep 11, 2024Nov 25, 2024Feb 8, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2024.10
96.7-
96.4-
2023.11
93.26-
2024.10
91-
2023.11
84.73-
83.4-
2023.11
83.36-
2023.11
79.95-
2023.11
78.41-
2023.11
74.83-
2023.11
74.74-
2023.11
71.93-
70.7-
69.6-
2024.07
68.9-
68.8-
2023.11
67.66-
65.9-
2023.11
61.18-
2024.07
61.1-
2023.11
50.43-
2024.10
50.3-
2024.10
49.7-
2024.10
49.6-
2024.10
49-
2024.10
48.8-
2024.10
48.7-
2024.07
48.5-
2024.07
43.9-
2024.07
37.9-
2024.07
31.5-
2025.02
26.6-
2025.02
26.5-
2025.02
26.4-
2025.02
26-
2025.02
24.9-
2025.02
24.9-
2025.02
24.8-
2025.02
24.1-
2025.02
23.6-
2025.02
22.6-
2025.02
21.8-
2025.02
21.3-
2025.02
20.6-
2025.02
17.9-
2026.02
-43.1
2026.02
-45.4
2026.02
-46.3
2026.02
-46.7
2026.02
-44.6
2026.02
-46.7
2026.02
-46.9
2026.02
-76.1
2026.02
-74.1
2026.02
-74.4
2026.02
-77.1
2026.02
-76.7
2026.02
-78.2
2026.02
-79.7