| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| ESD MED-TTS (Chinese) | SparkTTS | CER0.0311 | 9 | 4d ago | |
| ESD (English) | CosyVoice2 | WER1.411 | 9 | 4d ago | |
| IEMOCAP (Mid-mismatch) | CoCoEmo | E-SIM0.913 | 8 | 4d ago | |
| IEMOCAP Low-mismatch | CoCoEmo | E-SIM0.908 | 8 | 4d ago | |
| IEMOCAP (high-mismatch set) | CoCoEmo | E-SIM0.874 | 8 | 4d ago | |
| Mandarin emotional speech dataset | RRPO | E-MOS3.78 | 4 | 4d ago | |
| Spanish emotional set (test-es-emo) | IndexTTS 2.5 | SS Score0.848 | 2 | 4d ago | |
| Japanese emotional (test) | CosyVoice 3 | SS0.873 | 2 | 4d ago |