| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Common Sense Reasoning | HSWAG | Accuracy0.9751 | 52 | |
| Commonsense Reasoning | HSWAG Out-of-Domain (test) | Accuracy42.88 | 8 | |
| Commonsense Reasoning | HSwag | Normalized PLL Score27.8 | 4 | |
| Commonsense Reasoning | HSWAG French (test) | Accuracy33.5 | 4 | |
| Commonsense Reasoning | HSWAG German (test) | Accuracy28.78 | 4 |