Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Discrete Reasoning on DROP

71.59Exact Match (EM)

GPT-4

-2.863616.465735.79555.1243Nov 18, 2023Mar 21, 2024Jul 24, 2024Nov 25, 2024Mar 30, 2025Aug 1, 2025Dec 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2023.11
71.59-
2023.11
70.88-
2023.11
69.09-
2023.11
64.39-
2023.11
60.26-
2023.11
59.62-
2023.11
57.97-
2023.11
54.11-
2023.11
53.63-
2023.11
45.97-
2023.11
40.73-
2025.12
10.324.3
2025.12
8.521.6
2025.12
2.814.6
2025.12
2.58.6
2025.12
2.413.3
2025.12
2.29.9
2025.12
00
2025.12
00