| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Dialogue Commonsense Reasoning | CICERO v2 (test) | Accuracy93.25 | 4 | |
| Dialogue Commonsense Reasoning | CICERO v1 (test) | Accuracy88.04 | 4 | |
| Multiple Choice Question | CICERO v2 | Macro F188.63 | 2 | |
| Multiple Choice Question | CICERO | Macro F170.66 | 2 |