| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Speech | SeedTTS en (test) | WER1.521 | 21 | |
| Speech Reconstruction | SeedTTS en (test) | WER0.0214 | 18 | |
| Text-to-Speech | SeedTTS English (test) | WER1.69 | 12 | |
| Zero-shot Voice Imitation | SeedTTS vc-en (test) | UTMOS3.31 | 10 | |
| Voice Conversion | SeedTTS VC English (test) | WER2.15 | 8 | |
| Neural Audio Compression | SeedTTS English (test) | MOS4.126 | 8 | |
| Neural Audio Compression | SeedTTS Chinese (test) | MOS4.221 | 8 | |
| Text-to-Speech | SeedTTS en | Error Rate1.39 | 5 | |
| Text-to-Speech | SeedTTS (test) | NMOS3.72 | 4 | |
| Voice Imitation | SeedTTS vc-en (test) | N-MOS4.71 | 3 |