Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Reasoning on Rainbow large (val)
Loading...
92.5
aNLI
CompassMTL w/ Tailor
78.98
82.49
86
89.51
Oct 12, 2022
aNLI
CosmosQA
HellaSwag
PIQA
SocialIQA
Winogrande
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
aNLI
CosmosQA
HellaSwag
PIQA
SocialIQA
Winogrande
Average Score
CompassMTL w/ Tailor
Arch.=Enc only, Tasks=...
2022.10
92.5
88.8
96.1
88.3
82.2
90.5
89.7
CompassMTL
Arch.=Enc only, Tasks=...
2022.10
91.7
87.8
95.6
87.3
81.7
89.6
89
ExDeBERTa
Arch.=Enc only, Tasks=...
2022.10
87.9
85.3
83.6
85.5
79.6
87
84.8
ExT5
Arch.=Enc-Dec, Tasks=1...
2022.10
82.3
85.9
89
85
79.7
82.5
84.1
UNICORN
Arch.=Enc-Dec, Tasks=6...
2022.10
79.5
83.2
83
82.2
75.5
78.7
80.4
Feedback
Search any
task
Search any
task