| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Conversational Emotion Recognition | IEMOCAP | Weighted Average F1 Score77.3 | 174 | |
| Emotion Recognition in Conversation | IEMOCAP (test) | Weighted Average F1 Score72.4 | 168 | |
| Multimodal Emotion Recognition | IEMOCAP (test) | Accuracy80.75 | 162 | |
| Emotion Recognition | IEMOCAP | Accuracy78.8 | 151 | |
| Audio-Image-Text Classification | IEMOCAP (test) | Accuracy78.92 | 116 | |
| Multimodal Emotion Recognition | IEMOCAP 6-way | F1 (Avg)71.03 | 106 | |
| Emotion Classification | IEMOCAP (test) | Weighted-F174.02 | 61 | |
| Multimodal Emotion Recognition | IEMOCAP | AUC94.68 | 48 | |
| Emotion Recognition | IEMOCAP 4-class (test) | WAR80.66 | 46 | |
| Multimodal Emotion Recognition in Conversation | IEMOCAP 6-class (test) | Weighted F1 Score (WF1)72.81 | 44 | |
| Emotion Recognition | IEMOCAP (test) | Score (l)0.849 | 36 | |
| Emotion Recognition | IEMOCAPSix (test) | Accuracy62.02 | 35 | |
| Speech Emotion Recognition | IEMOCAP (Leave One Session Out (LOSO)) | WA64.8 | 31 | |
| Speech Emotion Recognition | IEMOCAP (five-fold/ten-fold cross-validation) | WA77.64 | 25 | |
| Speech Emotion Recognition | IEMOCAP | UA75.85 | 22 | |
| Speech Emotion Recognition | IEMOCAP (test) | Accuracy74.57 | 20 | |
| Speech Emotion Recognition | IEMOCAP Speaker-independent 5-fold cross-validation | WA78.47 | 19 | |
| Speech Emotion Recognition | IEMOCAP Speaker-Independent (test) | WA73.01 | 19 | |
| Emotion Recognition in Conversation | IEMOCAP | F1 Score68.03 | 19 | |
| Voice Anonymization | IEMOCAP (test) | UAR71.06 | 18 | |
| Voice Anonymization | IEMOCAP (dev) | UAR69.07 | 18 | |
| Emotion Recognition in Conversation | IEMOCAP 1.0 (test) | Weighted F1 Score68.57 | 17 | |
| Emotion Classification | IEMOCAP 4-way (test) | Weighted F185.9 | 17 | |
| Multimodal Emotion Recognition | IEMOCAP Word Aligned (test) | Happy Accuracy90.7 | 16 | |
| Emotional Text-to-Speech | IEMOCAP | Angry Score88.3 | 15 |