Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Common Sense Reasoning on PIQA (dev)
Loading...
83.2
Accuracy
Megatron-NLG
71.7496
74.7223
77.695
80.6677
Dec 13, 2021
Mar 12, 2022
Jun 10, 2022
Sep 8, 2022
Dec 6, 2022
Mar 6, 2023
Jun 4, 2023
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Megatron-NLG
Model Size=530B, Evalu...
2021.12
83.2
GPT-3
Model Size=175B, Evalu...
2021.12
82.3
Gopher
Model Size=280B, Evalu...
2021.12
81.8
GLaM
Model Size=64B/64E, Ev...
2021.12
81.8
GLaM
Model Size=64B/64E, Ev...
2021.12
81.4
GPT-3
Model Size=175B, Evalu...
2021.12
81
GPT-3
Model Size=175B, Evalu...
2021.12
80.5
GLaM
Model Size=64B/64E, Ev...
2021.12
80.4
CKT
Model Scale=large
2023.06
76.07
CALM
Model Scale=large
2023.06
75.11
T5
Model Scale=large
2023.06
72.19
Feedback
Search any
task
Search any
task