| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Speech (Text-Only input) | Emotion TTS (EN) Easy | ACC-I97.4 | 7 | |
| Text-to-Speech (Text-Only input) | Emotion TTS (EN) (Hard) | ACC-I89.4 | 6 | |
| Text-to-Speech (Text and Reference Speech input) | Emotion TTS ZH (Hard) | ACC-I75.8 | 5 | |
| Text-to-Speech (Text and Reference Speech input) | Emotion TTS ZH (Easy) | ACC-I81.8 | 5 | |
| Text-to-Speech (Text and Reference Speech input) | Emotion TTS (EN) Hard | ACC-I93.4 | 5 | |
| Text-to-Speech (Text and Reference Speech input) | Emotion TTS (EN) Easy | ACC-I93.4 | 5 | |
| Text-to-Speech (Text-Only input) | Emotion TTS (ZH) Easy | ACC-I99.8 | 4 | |
| Text-to-Speech (Text-Only input) | Emotion TTS (ZH) (Hard) | ACC-I98.4 | 3 |