Share your thoughts, 1 month free Claude Pro on usSee more

Zero-shot Commonsense Reasoning on ARC-Easy, ARC-Challenge, SIQA, PIQA, and WinoGrande

66.1Reasoning Accuracy

LLAMA-2

Updated 4mo ago

Evaluation Results

Method	Links
LLAMA-2 2024.03		66.1
BTX 2024.03		63.7
BTX 2024.03		63.5
LLAMA-2 2024.03		63.3
Dense 2024.03		63.3
Sparse upcycling 2024.03		62.3
BTM 2024.03		61.2
BTM 2024.03		61
CODELLAMA 2024.03		56.6
LLEMMA 2024.03		38.8