| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Speech Recognition | AISHELL-1 (dev) | WER1.2 | 28 | |
| Speaker Diarization | AISHELL-4 | DER (%)1.6 | 20 | |
| Automatic Speech Recognition | AISHELL (test) | CER1.95 | 20 | |
| Automatic Speech Recognition | AISHELL-2 (ios) | CER2.33 | 10 | |
| Automatic Speech Recognition | AISHELL-1 | WER1.96 | 10 | |
| Automatic Speech Recognition | AISHELL-1 1.0 (test) | CER (Offline, Rescoring)5.25 | 7 | |
| ASR Error Correction | AISHELL-1 (dev) | WER3.8 | 6 | |
| Target Speaker Extraction | AISHELL Noisy zero-shot | SI-SDR10.2 | 5 | |
| Target Speaker Extraction | AISHELL zero-shot Clean | SI-SDR13.4 | 5 | |
| Speaker-Attributed Automatic Speech Recognition | AISHELL-4 (test) | CER0.1543 | 4 | |
| Speech Watermarking | AiShell3 (OOD) | GN+Ec99.33 | 4 | |
| Speech Watermarking | AiShell3 (out-of-distribution) | Robustness (Gaussian Noise 5 dB)98.68 | 4 | |
| Automatic Speech Recognition | AISHELL | CER0.54 | 4 | |
| Automatic Speech Recognition | AISHELL | WER0.62 | 3 | |
| Timestamp Prediction | AISHELL (test) | AAS (ms)71 | 2 | |
| Automatic Speech Recognition | AISHELL (dev) | CER6 | 2 |