| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Common-sense Reasoning | COPA | Accuracy99.2 | 138 | |
| Question Answering | COPA | Accuracy96 | 59 | |
| Sentence Completion | COPA | Accuracy92.88 | 48 | |
| Commonsense Reasoning | COPA (test) | Accuracy98.67 | 46 | |
| Causal Question Answering | COPA | EM99.3 | 32 | |
| Causal Reasoning | COPA | Accuracy90 | 29 | |
| Multiple Choice | COPA | Accuracy100 | 12 | |
| Causal Reasoning | Copa100 | Accuracy83 | 12 | |
| Multi-class Classification | Copa | Accuracy79.75 | 12 | |
| Commonsense Causal Reasoning | COPA (dev) | Accuracy93 | 7 | |
| Commonsense Reasoning | COPA 2011 | Accuracy79 | 6 | |
| Choice of Plausible Alternatives | COPA 11 languages | Score55.5 | 5 | |
| Commonsense Reasoning | COPA (dev) | Accuracy86 | 3 | |
| Choice of Plausible Alternatives | COPA (dev) | Accuracy65.8 | 3 |