| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Binary Classification | ESNLI | Accuracy99 | 18 | |
| Text to Audio | ESNLI (test) | Accuracy79.6 | 6 | |
| Text to Text | ESNLI (test) | Accuracy (ESNLI Test)80.6 | 6 | |
| Audio to Audio | ESNLI (test) | Accuracy77.1 | 6 | |
| Audio to Text | ESNLI (test) | Accuracy77.4 | 6 | |
| Explanation Generation | eSNLI complete (test) | BERTScore88.84 | 6 | |
| Generative Task | esnli | BS Score60.02 | 4 | |
| Reasoning trace quality evaluation | eSNLI | Grammar Score5.9 | 2 |