| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Speech Reconstruction | LibriTTS clean (test) | PESQ4.186 | 50 | |
| Speech Reconstruction | LibriTTS (test-other) | UTMOS3.91 | 44 | |
| Audio Generation | LibriTTS (dev) | M-STFT1.3647 | 18 | |
| Speech Synthesis | LibriTTS (test) | MOS4.9134 | 17 | |
| Text-to-Speech | LibriTTS (test) | MOS4.54 | 16 | |
| Text-to-Speech | LibriTTS clean (test) | WER0.018 | 15 | |
| Text-to-Speech | LibriTTS zero-shot | UTMOS4.3026 | 14 | |
| Waveform Generation | LibriTTS 24,000 Hz (test) | UTMOS3.7229 | 13 | |
| Waveform Generation | LibriTTS (dev) | M-STFT1.2129 | 12 | |
| Voice Conversion | LibriTTS (test-clean) | WER2.04 | 11 | |
| Speech Synthesis | LibriTTS 24,000 Hz (test) | MOS4.28 | 11 | |
| Waveform Generation | LibriTTS-R clean (test) | Speech BERT Score100 | 10 | |
| Audio Reconstruction | LibriTTS clean (test) | Mel Distance0.3442 | 10 | |
| Vocoding | LibriTTS (dev-other) | MAE0.0986 | 10 | |
| Neural Vocoding | LibriTTS clean (dev) | MAE0.0931 | 10 | |
| Speech Synthesis | LibriTTS (ID) | PESQ4.5 | 9 | |
| Audio Watermarking | LibriTTS | PESQ4.3289 | 8 | |
| Generative Speech Watermarking | LibriTTS OOD (test) | STOI0.9789 | 8 | |
| Neural Vocoding | LibriTTS | UTMOS4.058 | 8 | |
| Speech Resynthesis | LibriTTS (test-clean) | WER3.32 | 7 | |
| Speech Reconstruction | LibriTTS (test) | PESQ4.16 | 7 | |
| Universal Neural Vocoding | LibriTTS clean and other (dev) | M-STFT0.7997 | 6 | |
| Voice Conversion | LibriTTS unseen-to-unseen (test-clean) | MOS4.27 | 6 | |
| Voice Conversion | LibriTTS to VCTK (unseen-to-seen) (test-clean) | MOS4.29 | 6 | |
| Speech Dereverberation | LibriTTS (clean part) + openSLR26/28 RIR (test) | PESQ2.87 | 5 |