| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MLS Pt filtered (test) | Voicebox (Multilingual) | WER4.9 | 15 | 4d ago | |
| MLS Fr filtered (test) | Voicebox (Multilingual) | WER5.1 | 15 | 4d ago | |
| MLS En filtered (test) | Voicebox (Multilingual) | WER0.038 | 15 | 4d ago | |
| LibriSpeech-PC clean (test) | BigVGAN | WER1.78 | 12 | 4d ago | |
| MLS Pl filtered (test) | Voicebox (Multilingual) | WER4 | 8 | 4d ago | |
| MLS Es filtered (test) | Voicebox (Multilingual) | WER3.5 | 8 | 4d ago | |
| MLS De filtered (test) | Voicebox (Multilingual) | WER4.7 | 8 | 4d ago | |
| LibriSpeech clean (test) | SpeechEdit | WER1.3 | 6 | 4d ago | |
| LibriSpeech SNR = 0dB (test-clean) | OZSpeech | UTMOS2.58 | 6 | 4d ago | |
| LibriSpeech SNR = 6dB (test-clean) | OZSpeech | UTMOS2.9 | 6 | 4d ago | |
| LibriSpeech SNR = 12dB (test-clean) | F5-TTS | UTMOS3.09 | 6 | 4d ago | |
| LibriSpeech SNR = ∞ (test-clean) | F5-TTS | UTMOS3.76 | 6 | 4d ago | |
| LibriSpeech clean cross-sentence filtered (test) | VB-En | WER1.9 | 5 | 4d ago | |
| Chinese Speech Emotion Prompt | IndexTTS2 | WER0.0162 | 4 | 4d ago | |
| English Speech Emotion Prompt | F5TTS | WER0.0194 | 4 | 4d ago | |
| LibriSpeech test-clean filtered (continuation) | VB-En | WER2 | 3 | 4d ago | |
| CREMA-D unseen opt-out set (-UO) | F5-TTS | SIM-UO21.7 | 2 | 4d ago |