| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Speech | Seed-TTS en (test) | WER1.24 | 50 | |
| Text-to-Speech | Seed-TTS zh (test) | WER0.0118 | 47 | |
| Text-to-speech | Seed-TTS (eval) | WER1.85 | 39 | |
| Text-to-Speech | Seed-TTS Seed-ZH (Evaluation) | CER0.89 | 16 | |
| Text-to-Speech | Seed-TTS Seed-ZH (test) | WER1.02 | 11 | |
| Text-to-Speech | Seed-TTS Seed-EN (test) | WER0.0147 | 11 | |
| Text-to-Speech | Seed-TTS 24 kHz (test-zh) | SIM-o0.762 | 11 | |
| Text-to-Speech | Seed-TTS en 24 kHz (test) | SIM-o0.734 | 11 | |
| Voice Cloning | SEED-TTS-Eval ZH (test) | CER1.03 | 8 | |
| Voice Cloning | SEED-TTS EN (test) | WER1.83 | 8 | |
| Voice Conversion | Seed-TTS English | SECS0.9129 | 7 | |
| Text-to-Speech | Seed-TTS English (test) | WER2.14 | 5 | |
| TTS Quality Improvement | Seed-TTS MaskGCT | WVMOS4.172 | 5 | |
| TTS Quality Improvement | Seed-TTS MoonCast | WVMOS4.098 | 5 | |
| Automatic Speech Recognition | Seed-TTS EN | WER1.47 | 4 | |
| Automatic Speech Recognition | Seed-TTS ZH | WER0.013 | 4 | |
| Text-to-Speech | Seed-TTS ZH | WER1.21 | 3 | |
| Text-to-Speech | Seed-TTS EN | WER3.1 | 3 | |
| Text-to-Speech | Seed-TTS-Eval ZH | UTMOS2.92 | 3 |