| MLS Pt filtered (test) | Voicebox (Multilingual) | WER4.9 | | 15 | 1mo ago |
| MLS Fr filtered (test) | Voicebox (Multilingual) | WER5.1 | | 15 | 1mo ago |
| MLS En filtered (test) | Voicebox (Multilingual) | WER0.038 | | 15 | 1mo ago |
| YoBind (test) | Fish-Speech | SECS (s)0.764 | | 12 | 1mo ago |
| LibriTTS (test) | Fish-Speech | SECS0.765 | | 12 | 1mo ago |
| LibriSpeech-PC clean (test) | BigVGAN | WER1.78 | | 12 | 1mo ago |
| Singing Voice | Vevo2 | WER7.66 | | 8 | 1mo ago |
| Expressive Speech | | WER10.91 | | 8 | 1mo ago |
| MLS Pl filtered (test) | Voicebox (Multilingual) | WER4 | | 8 | 1mo ago |
| MLS Es filtered (test) | Voicebox (Multilingual) | WER3.5 | | 8 | 1mo ago |
| MLS De filtered (test) | Voicebox (Multilingual) | WER4.7 | | 8 | 1mo ago |
| EARS (unseen speakers) | Fun-CosyVoice3-0.5B | WER1.65 | | 7 | 1mo ago |
| LibriSpeech clean (test) | SpeechEdit | WER1.3 | | 6 | 1mo ago |
| LibriSpeech SNR = 0dB (test-clean) | OZSpeech | UTMOS2.58 | | 6 | 1mo ago |
| LibriSpeech SNR = 6dB (test-clean) | OZSpeech | UTMOS2.9 | | 6 | 1mo ago |
| LibriSpeech SNR = 12dB (test-clean) | F5-TTS | UTMOS3.09 | | 6 | 1mo ago |
| LibriSpeech SNR = ∞ (test-clean) | F5-TTS | UTMOS3.76 | | 6 | 1mo ago |
| YoBind | | MOS (Naturalness)4.31 | | 5 | 1mo ago |
| LibriSpeech clean cross-sentence filtered (test) | VB-En | WER1.9 | | 5 | 1mo ago |
| Chinese Speech Emotion Prompt | IndexTTS2 | WER0.0162 | | 4 | 1mo ago |
| English Speech Emotion Prompt | F5TTS | WER0.0194 | | 4 | 1mo ago |
| LibriSpeech test-clean filtered (continuation) | VB-En | WER2 | | 3 | 1mo ago |
| CREMA-D unseen opt-out set (-UO) | F5-TTS | SIM-UO21.7 | | 2 | 1mo ago |