Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Complex Reasoning on BIG-bench Hard
Loading...
39.3
Orig Score
FLAN-T5
-1.572
9.039
19.65
30.261
Dec 19, 2022
Orig Score
QA Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Orig Score
QA Score
FLAN-T5
#Examples=14,336,000,...
2022.12
39.3
40
T5-LM on Unnatural Instructions + Instruction Paraphrases
#Examples=240,670, Eva...
2022.12
28.1
29.4
T0++
#Examples=12,492,800,...
2022.12
20.2
13.9
T5-LM on Unnatural Instructions
#Examples=64,000, Eval...
2022.12
16
29.5
T5-LM on Super-Natural Instructions
#Examples=64,000, Eval...
2022.12
10.2
29.7
Tk-Instruct
#Examples=75,417, Eval...
2022.12
5.8
11.8
T5-LM
#Examples=0, Evaluatio...
2022.12
0
0.7
Feedback
Search any
task
Search any
task