Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on OpenBook-QA (Accuracy and Performance Gain)

91.6Accuracy

Mistral Small 24B Inst 2501

42.09654.94867.880.652Jan 30, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
91.6-0.2
2026.01
91.2-0.6
2026.01
85.60.4
2026.01
85.60.4
2026.01
82.41.4
2026.01
81.80.8
2026.01
76.61
2026.01
76.61
2026.01
76.40.2
2026.01
76.20
2026.01
74.81.2
2026.01
74.20.6
2026.01
73.40.2
2026.01
73-0.2
2026.01
704.2
2026.01
68.80.6
2026.01
68.40.2
2026.01
66.20.4
2026.01
65.40.6
2026.01
64.80
2026.01
59.80
2026.01
59.6-0.2
2026.01
47.43.8
2026.01
440.4