| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Commonsense Reasoning | SIQA | Accuracy89.85 | 96 | |
| Social Interaction Question Answering | SIQA | Accuracy86.9 | 85 | |
| Reasoning | SIQA | Accuracy83.2 | 44 | |
| Social Commonsense Reasoning | SIQA | Accuracy85.24 | 32 | |
| Social Commonsense Reasoning | SIQA (test) | Accuracy83.3 | 20 | |
| Reasoning | SIQA | Accuracy Improvement2.12 | 12 | |
| Reasoning | SIQA (val) | Accuracy35.47 | 9 | |
| Commonsense Reasoning | SIQA (test) | Accuracy40.28 | 6 | |
| Social Reasoning | SIQA | Performance (%)15.2 | 6 | |
| Social Interaction Question Answering | SIQA | Normalized PLL Score15.4 | 4 |