| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Commonsense QA | Commonsense QA (ARC-E, ARC-C, HellaS, WinoG, BoolQ, OBQA, RTE, CoPa, Race) zero-shot | ARC-Easy Accuracy81.19 | 57 | |
| Commonsense Question Answering | Commonsense QA | BoolQ Accuracy77.4 | 29 | |
| Commonsense Question Answering | Commonsense QA | Reusability Score50.97 | 12 | |
| Question Answering | Commonsense QA | PIQA80.85 | 12 | |
| Question Answering | Commonsense QA | ARC-E Accuracy82.83 | 9 | |
| Question Answering | Commonsense QA QA-avg | QA-avg Score61.67 | 4 | |
| Commonsense Reasoning | Commonsense QA | Average Relative Improvement3.9 | 3 | |
| Commonsense Question Answering | Commonsense QA 8 datasets | Average QA Score66.1 | 3 |