Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-shot Reasoning on OpenBookQA

44Accuracy

Llama2-7B

12.17620.43828.736.962Feb 16, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
44
2026.02
32.8
2026.02
30.4
2026.02
29.2
2026.02
26
2026.02
25.2
2026.02
13.4