| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Zero-shot Text-to-Speech | MLS Pt filtered (test) | WER4.9 | 15 | |
| Zero-shot Text-to-Speech | MLS Fr filtered (test) | WER5.1 | 15 | |
| Zero-shot Text-to-Speech | MLS En filtered (test) | WER0.038 | 15 | |
| Text-to-Speech | Filtered MLS English (test) | SMOS3.96 | 12 | |
| Automatic Speech Recognition | MLS FR (test) | WER3.8 | 10 | |
| Automatic Speech Recognition | MLS ES (test) | WER (%)2.7 | 10 | |
| Automatic Speech Recognition | MLS DE (test) | WER (%)3.1 | 10 | |
| Automatic Speech Recognition | MLS En | WER3.68 | 10 | |
| Automatic Speech Recognition | MLS | NL Score-2.9 | 8 | |
| Zero-shot Text-to-Speech | MLS Pl filtered (test) | WER4 | 8 | |
| Zero-shot Text-to-Speech | MLS Es filtered (test) | WER3.5 | 8 | |
| Zero-shot Text-to-Speech | MLS De filtered (test) | WER4.7 | 8 | |
| Speech Recognition | MLS English (test) | WER4.2 | 6 | |
| Automatic Speech Recognition | MLS | WER (ES)3.3 | 4 | |
| Language Identification | MLS (test) | Accuracy99.9 | 3 |