Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Physical Commonsense Reasoning on PIQA (0-shot and 32-shot)

79.8Accuracy (0-shot)

OPT-IML 175B

71.4873.6475.877.96Dec 22, 2022
Updated 1mo ago

Evaluation Results

MethodLinks
2022.12
79.880.5
2022.12
79.581.6
2022.12
77.578.8
2022.12
77.369.2
2022.12
72.372.6
2022.12
71.872.1