| VCTK | | WER0 | | 21 | 1mo ago |
| LibriSpeech (test-clean) | GT (Vocoder) | WER2.87 | | 13 | 1mo ago |
| ELHE HE portion (160 unmodified utterances) | | CER2.9 | | 11 | 1mo ago |
| LibriTTS (test-clean) | | WER2.04 | | 11 | 1mo ago |
| Seed-TTS zh (test) | | WER1.33 | | 9 | 4d ago |
| VCTK (test) | | nMOS4.26 | | 9 | 1mo ago |
| LibriSpeech (test-clean and test-other) | SCF | WER2.18 | | 8 | 1mo ago |
| SeedTTS VC Chinese (test) | | WER1.25 | | 8 | 1mo ago |
| SeedTTS VC English (test) | | WER2.15 | | 8 | 1mo ago |
| Seed-TTS en (test) | | WER1.96 | | 7 | 4d ago |
| LibriSpeech (test-clean source, test-other target) | | MOS3.8 | | 7 | 1mo ago |
| Seed-TTS English | CosyVoice-VC | SECS0.9129 | | 7 | 1mo ago |
| Elliot Miller target speaker | | WER3.22 | | 7 | 1mo ago |
| LJSpeech target speaker | | WER3.22 | | 7 | 1mo ago |
| Expresso OOD | | F0 Correlation0.543 | | 6 | 1mo ago |
| TIMIT OOD | | F0 Correlation0.484 | | 6 | 1mo ago |
| Voice Conversion (VC) Benchmark | | WER3.25 | | 6 | 1mo ago |
| LibriTTS unseen-to-unseen (test-clean) | | MOS4.27 | | 6 | 1mo ago |
| LibriTTS to VCTK (unseen-to-seen) (test-clean) | | MOS4.29 | | 6 | 1mo ago |
| VCTK seen-to-seen (test) | | MOS4.32 | | 6 | 1mo ago |
| ZeroSpeech Indonesian 2019 (test) | Chen and Hain | CER15 | | 6 | 1mo ago |
| ZeroSpeech English 2019 (test) | Chen and Hain | CER18 | | 6 | 1mo ago |
| CMU Arctic clb to slt | SpeechT5 | MCD5.87 | | 5 | 1mo ago |
| CMU Arctic bdl to slt | SpeechT5 | MCD5.93 | | 5 | 1mo ago |
| Voice Conversion (VC) Zero-shot | SeedVC | UTMOS4.04 | | 4 | 1mo ago |