Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on OpenBookQA (Normalized Accuracy)

45Normalized Accuracy

Llama 3-8B

-2.329.96522.2534.535Dec 13, 2024Feb 12, 2025Apr 14, 2025Jun 15, 2025Aug 15, 2025Oct 15, 2025Dec 16, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2024.12
45
2024.12
44.8
2025.02
33.6
2025.02
33.2
2025.02
33
2025.02
32.8
2025.02
32.8
2025.02
32.4
2025.02
32.2
2025.12
31.6
2025.12
30.8
2025.12
30.8
2025.02
30.4
2025.02
30.4
2025.02
30.2
2025.12
30.18
2025.12
30
2025.02
30
2025.12
29.93
2025.12
29.8
2025.12
29.4
2025.12
29.22
2025.12
29.2
2025.12
29
2025.12
28.4
2025.12
28
2025.12
27
2025.12
27
2025.12
26.6
2025.12
17.6
2025.12
14.4
2025.12
13.6
2025.12
9.9
2025.12
8.5
2025.12
-0.5