| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Automatic Speech Recognition | WenetSpeech Meeting (test) | CER4.67 | 78 | |
| Automatic Speech Recognition | WenetSpeech Net (test) | CER4.6 | 57 | |
| Audio-Text | Wenetspeech | WER6.6 | 34 | |
| Automatic Speech Recognition | WenetSpeech (meeting) | WER4.63 | 23 | |
| Automatic Speech Recognition | WenetSpeech net | WER4.68 | 20 | |
| Automatic Speech Recognition | WenetSpeech (dev) | CER7.19 | 14 | |
| Automatic Speech Recognition | WenetSpeech Chuan hard | CER20.2 | 13 | |
| Automatic Speech Recognition | WenetSpeech Chuan easy | CER10.94 | 13 | |
| Audio Question Answering | WenetSpeech-QA | Audio-QA Score48.47 | 12 | |
| Automatic Speech Recognition | WenetSpeech Yue long | CER5.82 | 12 | |
| Automatic Speech Recognition | WenetSpeech (test-net) | CER5.29 | 10 | |
| Automatic Speech Recognition | WenetSpeech net | Character Error Rate (CER)4.97 | 6 | |
| Automatic Speech Recognition | WenetSpeech Meeting domain (ws-meeting) | CER4.32 | 5 | |
| Automatic Speech Recognition | WenetSpeech Internet domain (ws-net) | CER4.44 | 5 | |
| Audio Understanding | Wenetspeech (testnet) | Error Rate4.69 | 4 | |
| Automatic Speech Recognition | WenetSpeech-Yue Long sentence | WER12.1 | 4 | |
| Automatic Speech Recognition | WenetSpeech (test_net) | WER4.66 | 4 | |
| Automatic Speech Recognition | WenetSpeech n (Net) | WER (%)6.46 | 3 | |
| Text-to-Speech | WenetSpeech Yue-TTS | WER12.2 | 3 |