| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text Classification | Emotion | ASR (%)0.808 | 36 | |
| Emotion Classification | Emotion | Accuracy87.68 | 26 | |
| Explanation Faithfulness | Emotion | Delta AF Score5.637 | 24 | |
| Emotion Classification | Emotion | Delta 1 (Emotion)3.27 | 24 | |
| Emotion Classification | Emotion (Out-of-domain) | F1 Score0.4286 | 22 | |
| Text Classification | Emotion | Accuracy91.4 | 18 | |
| Classification | Emotion | ASR100 | 15 | |
| Emotion classification | Emotion | Correlation Coefficient0.6 | 12 | |
| OOD Detection | Emotion | AUROC0.997 | 12 | |
| Open-set selective classification | Emotion (test) | AUAC95.2 | 12 | |
| Emotion Editing | Emotion Easy Task | WER1.4 | 11 | |
| Multi-class Classification | Emotion (EM) | Accuracy54.58 | 11 | |
| Text Classification | Emotion | Total Communication Time ($10^3$ s)2.25 | 9 | |
| Text Classification | Emotion | Accuracy89 | 8 | |
| Multimodal Generation | Emotion | BLEU-L37.65 | 6 | |
| Label Distribution Learning | emotion6 (3 random splits) | KL Divergence0.5111 | 5 | |
| Text Classification | Emotion (test) | Accuracy57.4 | 5 | |
| Time-Series Segmentation | Emotion (test) | F-score0.5833 | 5 | |
| Text Classification | Emotion Plain-text OOD | Accuracy80.66 | 4 | |
| Emotion Editing | Emotion Hard Task | WER2 | 4 | |
| Multi-Task Learning | Emotion | Epochs200 | 4 | |
| Machine Translation (En-Zh) | Emotion | BLEU0.64 | 4 | |
| Machine Translation (En-Fr) | Emotion | BLEU0.47 | 4 | |
| Text Classification | Emotion | ASR36.8 | 4 | |
| Text Translation | Emotion Ali Translate en-zh (test) | BLEU55 | 3 |