| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | QASC | Score89.6 | 36 | |
| Multiple Choice Question Answering | QASC | Accuracy100 | 22 | |
| Multiple Choice Question Answering | QASC (test) | Accuracy78.5 | 16 | |
| Commonsense Reasoning | QASC (dev) | Accuracy84.02 | 14 | |
| Question Answering | QASC | F114.73 | 10 | |
| Multiple Choice Question Answering | QASC (dev) | Accuracy67.61 | 10 | |
| Logical Refinement of Natural Language Explanations | QASC | Initial Score17 | 8 | |
| Commonsense Question Answering | QASC (dev) | Accuracy83.7 | 7 | |
| Commonsense Reasoning | QASC (test) | Accuracy90.06 | 6 | |
| Commonsense Question Answering | Scientific Commonsense (QASC) 1.0 (test) | Accuracy53.04 | 5 | |
| Question Answering | QASC MRQA few-shot | F1 Score99.1 | 5 | |
| Commonsense Question Answering | QASC | Accuracy72.8 | 4 |