| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Symbolic Reasoning | Date Understanding (DU) | Accuracy87.2 | 10 | |
| Multiple-Choice Reasoning | Date Understanding (test) | Accuracy78.2 | 8 | |
| Symbolic Reasoning | Date Understanding (DU) (test) | Accuracy67.52 | 4 | |
| Commonsense Reasoning | Date Understanding | Accuracy16.3 | 3 | |
| Logical Reasoning | Date Understanding | Accuracy (format-specific prompt)67.5 | 2 |