Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Physical Commonsense Reasoning on PIQA (0-shot and 32-shot)

79.8Accuracy (0-shot)

OPT-IML 175B

71.4873.6475.877.96Dec 22, 2022
Updated 4d ago

Evaluation Results

MethodLinks
2022.12
79.880.5
2022.12
79.581.6
2022.12
77.578.8
2022.12
77.369.2
2022.12
72.372.6
2022.12
71.872.1