Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Reasoning on OBQA (dev)
Loading...
66.7
Accuracy
CKT
61.188
62.619
64.05
65.481
Jun 4, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CKT
Model Scale=large
2023.06
66.7
CALM
Model Scale=large
2023.06
66
T5
Model Scale=large
2023.06
61.4
Feedback
Search any
task
Search any
task