Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Zero-shot Evaluation on GPT-3 Evaluation Suite (LAMBADA, TriviaQA, WebQs, PIQA, RACE-h, BoolQ)
Loading...
44.4
Overall Accuracy
GPT-3 1.3B (Original)
41.488
42.244
43
43.756
Aug 13, 2021
Overall Accuracy
LAMBADA Accuracy
TriviaQA Accuracy
WebQs Accuracy
PIQA Accuracy
RACE-h Accuracy
BoolQ Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Overall Accuracy
LAMBADA Accuracy
TriviaQA Accuracy
WebQs Accuracy
PIQA Accuracy
RACE-h Accuracy
BoolQ Accuracy
GPT-3 1.3B (Original)
Case=Original [6], Mod...
2021.08
44.4
63.6
19.7
4.63
75.1
40.9
62.4
GPT-3 1.3B (SLW)
Case=8x Bsz, Model siz...
2021.08
41.9
65
11.3
2.36
73.8
37.1
61.8
GPT-3 1.3B (Baseline repro)
Case=repro, Model size...
2021.08
41.6
63.7
10.1
3.25
73.4
35.6
63.4
Feedback
Search any
task
Search any
task