| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Commonsense Reasoning | Commonsense | RCC74.6 | 29 | |
| Commonsense Reasoning | Commonsense 8 Sub-Tasks | Accuracy (8 Sub-Tasks)61.4 | 26 | |
| Commonsense Reasoning | Commonsense170k (test) | BoolQ Accuracy75.4 | 22 | |
| Morality Evaluation | Commonsense | Mean Improvement10 | 9 | |
| Commonsense Reasoning | Commonsense-15K (test) | ARC-Challenge Accuracy36.11 | 7 | |
| Commonsense reasoning | Commonsense-15K | ARC-Challenge Accuracy53.33 | 5 | |
| Commonsense Reasoning | Commonsense Gender (test) | Accuracy20 | 5 | |
| Commonsense Reasoning | Commonsense Race (test) | Correctness Rate40.4 | 5 | |
| Question Answering | CommonSense | Perplexity25.969 | 3 |