| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Commonsense Reasoning | Commonsense | RCC74.6 | 29 | |
| Commonsense Reasoning | Commonsense 8 Sub-Tasks | Accuracy (8 Sub-Tasks)61.4 | 23 | |
| Commonsense Reasoning | Commonsense170k (test) | BoolQ Accuracy75.4 | 22 | |
| Commonsense Reasoning | Commonsense Gender (test) | Accuracy20 | 5 | |
| Commonsense Reasoning | Commonsense Race (test) | Correctness Rate40.4 | 5 | |
| Question Answering | CommonSense | Perplexity25.969 | 3 |