Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-shot Commonsense Reasoning on ARC-Easy, ARC-Challenge, SIQA, PIQA, and WinoGrande

66.1Reasoning Accuracy

LLAMA-2

37.70845.07952.4559.821Mar 12, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.03
66.1
2024.03
63.7
2024.03
63.5
2024.03
63.3
2024.03
63.3
2024.03
62.3
2024.03
61.2
2024.03
61
2024.03
56.6
2024.03
38.8