| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Speaker-Attributed Automatic Speech Recognition | AISHELL-4 (test) | cpCER0.2004 | 33 | |
| Automatic Speech Recognition | AISHELL-1 | WER0.6 | 31 | |
| Speech Recognition | AISHELL-1 (dev) | WER1.2 | 28 | |
| Automatic Speech Recognition | AISHELL (test) | CER1.13 | 26 | |
| Speaker Diarization | AISHELL-4 | DER (%)1.6 | 20 | |
| Automatic Speech Recognition | AISHELL-2 (ios) | CER2.29 | 16 | |
| Automatic Speech Recognition | AISHELL D 2021 (test) | CER1.66 | 15 | |
| Automatic Speech Recognition | AISHELL C 2021 (Eval) | CER1.51 | 15 | |
| Automatic Speech Recognition | AISHELL Eval A 2021 | CER3.12 | 15 | |
| Automatic Speech Recognition | AISHELL-2 ios (dev) | CER2.07 | 15 | |
| Speech Synthesis | AISHELL3 Mandarin | UTMOS2.7 | 14 | |
| Automatic Speech Recognition | AISHELL-1 | Word Error Rate (WER)0.76 | 10 | |
| Automatic Speech Recognition | AISHELL-1 | Error Rate0.71 | 10 | |
| Automatic Speech Recognition | SLR93 (AISHELL-3) (test) | CER4.55 | 10 | |
| Automatic Speech Recognition | AISHELL Mandarin 3 | CER1.86 | 9 | |
| Automatic Speech Recognition | AISHELL-2 | Word Error Rate (WER)2.16 | 7 | |
| Automatic Speech Recognition | AISHELL-1 1.0 (test) | CER (Offline, Rescoring)5.25 | 7 | |
| ASR Error Correction | AISHELL-1 (dev) | WER3.8 | 6 | |
| Target Speaker Extraction | AISHELL Noisy zero-shot | SI-SDR10.2 | 5 | |
| Target Speaker Extraction | AISHELL zero-shot Clean | SI-SDR13.4 | 5 | |
| Timestamp ASR | AISHELL-1 zh | AAS Score833.66 | 4 | |
| Automatic Speech Recognition | AISHELL-3 (test) | CER23.47 | 4 | |
| Speaker-aware ASR | AISHELL-1 & AISHELL-2 augmented with VoxCeleb & MUSAN 20 dB SNR | WER1.24 | 4 | |
| Speaker-aware ASR | AISHELL-1 & AISHELL-2 augmented with VoxCeleb & MUSAN 10 dB SNR | WER1.43 | 4 | |
| Speaker-aware ASR | AISHELL-1 & AISHELL-2 augmented with VoxCeleb & MUSAN (2 dB SNR) | WER5.34 | 4 |