| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Zero-shot Common Sense Reasoning | Common Sense Reasoning | Zero-shot Accuracy72.6 | 137 | |
| Common Sense Reasoning | 7 common sense reasoning datasets | Average Performance71.72 | 61 | |
| Common-sense Reasoning | Common-sense Reasoning Average | Average Accuracy73.44 | 39 | |
| Commonsense Reasoning | Common Sense Reasoning ARC-C, ARC-E, PIQA, StoryCloze | Average Accuracy55.58 | 34 | |
| Common-sense reasoning | Common-sense reasoning (BoolQ, SciQ, PIQA, WinoG., ARC-C, HellaS.) | BoolQ Accuracy88 | 15 | |
| Common Sense Reasoning | Six common sense reasoning tasks | Accuracy69.46 | 15 | |
| Common Sense Reasoning | Common Sense Reasoning ARC, ARC-Easy, HellaSwag, OpenBookQA, PIQA, RACE, WinoGrande | ARC Accuracy47.3 | 13 | |
| Common Sense Reasoning | Common Sense Reasoning (ARC, ARE, HS, OB, PI, RA, WG) | ARC Score39.3 | 12 | |
| Common Sense Reasoning | Common Sense Reasoning | ARC Accuracy47.9 | 10 |