| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| ASR Error Correction | Common Voice Frisian (test) | WER8.9 | 27 | |
| Automatic Speech Recognition | Common Voice | WER7.76 | 22 | |
| Audio Reconstruction | Common Voice | CVU0.9888 | 21 | |
| Speech Recognition | Common Voice | WER9 | 17 | |
| Transcript Alignment | Common Voice English 8 (test) | Character GLE58.9 | 16 | |
| Speech Recognition | Common Voice EN | WER6.76 | 16 | |
| Automatic Speech Recognition | Common Voice en 15 | WER4.83 | 16 | |
| Automatic Speech Recognition | Common Voice English Accents Exp. 1 | WER (US Accent)15.4 | 15 | |
| Speech quality evaluation | Common Voice 17 | Quality Score (NL)2.55 | 14 | |
| Multilingual Automatic Speech Recognition | Common Voice Exp. 3 (test) | WER (US)15.4 | 13 | |
| Automatic Speech Recognition | Common Voice Arabic (test) | WER38.21 | 12 | |
| Automatic Speech Recognition | Common Voice Mandarin (test) | CER17.82 | 12 | |
| Automatic Speech Recognition | Common Voice Spanish (test) | WER24.58 | 12 | |
| Automatic Speech Recognition | Common Voice tw | CER1.61 | 10 | |
| Speech Classification | Common Voice (test) | Macro-F148.68 | 10 | |
| Automatic Speech Recognition | Common Voice 15 | WER6.1 | 10 | |
| Automatic Speech Recognition | Common Voice Tamil | WER31.78 | 9 | |
| Automatic Speech Recognition | Common Voice Mandarin | CER4.91 | 9 | |
| Text-to-Speech | Common Voice en 15 | WER10.8 | 9 | |
| Automatic Speech Recognition | Common Voice (test) | WER8.5 | 9 | |
| Automatic Speech Recognition | Common Voice Unseen Languages | WER43.2 | 8 | |
| Continual Learning for Automatic Speech Recognition | Common Voice Exp. 1 (test) | WER (US)15 | 7 | |
| Voice Cloning | Common Voice English | SIM Score0.81 | 7 | |
| Automatic Speech Recognition | Common Voice Singaporean | WER4.9 | 7 | |
| Automatic Speech Recognition | Common Voice Indian | WER0.057 | 7 |