Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Common Sense Reasoning on ARC, BoolQ, RTE, Winogrande, and TruthfulQA

34ARC Challenge Accuracy

OpenLLaMA (V1)

22.5625.5328.531.47Dec 28, 2023
Updated 3d ago

Evaluation Results

MethodLinks
2023.12
34696858622235
2023.12
32686752632133
2023.12
32686159632336
2023.12
28626252552541
2023.12
26616352572339
2023.12
26615353592135
2023.12
24575651592439
2023.12
23575955572339