| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Commonsense Reasoning and Short-Context Language Understanding | Commonsense Reasoning and Short-Context Language Understanding Suite (PIQA, HellaSwag, WinoGrande, ARC-Easy, ARC-Challenge, SIQA, BoolQ, LAMBADA) zero-shot | PIQA Accuracy (Zero-shot)73.3 | 2 |