| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Natural Language Processing | 7 NLP Tasks (test) | Average Accuracy88.9 | 20 | |
| Natural Language Processing | 8 NLP Tasks (avg) | Accuracy82.21 | 10 | |
| Natural Language Processing | eleven NLP tasks | Average Accuracy73.1 | 10 | |
| NLP tasks | 11 NLP tasks Symbol-Tuning (held-out) | Accuracy86.4 | 9 | |
| Multiple-Choice Classification | 14 standard NLP tasks suite (held-out) | StoryCloze86.9 | 8 | |
| Natural Language Understanding | 10 NLP tasks (BoolQ, CB, COPA, H-SWAG, MultiRC, RTE, Story Cloze, WiC, Winogrande, WSC) (test) | BoolQ92.1 | 3 |