Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AISHELL

Benchmarks

Task NameDataset NameSOTA ResultTrend
Speaker-Attributed Automatic Speech RecognitionAISHELL-4 (test)
cpCER0.2004
33
Automatic Speech RecognitionAISHELL-1
WER0.6
31
Speech RecognitionAISHELL-1 (dev)
WER1.2
28
Automatic Speech RecognitionAISHELL (test)
CER1.13
26
Speaker DiarizationAISHELL-4
DER (%)1.6
20
Automatic Speech RecognitionAISHELL-2 (ios)
CER2.29
16
Automatic Speech RecognitionAISHELL D 2021 (test)
CER1.66
15
Automatic Speech RecognitionAISHELL C 2021 (Eval)
CER1.51
15
Automatic Speech RecognitionAISHELL Eval A 2021
CER3.12
15
Automatic Speech RecognitionAISHELL-2 ios (dev)
CER2.07
15
Speech SynthesisAISHELL3 Mandarin
UTMOS2.7
14
Automatic Speech RecognitionAISHELL-1
Word Error Rate (WER)0.76
10
Automatic Speech RecognitionAISHELL-1
Error Rate0.71
10
Automatic Speech RecognitionSLR93 (AISHELL-3) (test)
CER4.55
10
Automatic Speech RecognitionAISHELL Mandarin 3
CER1.86
9
Automatic Speech RecognitionAISHELL-2
Word Error Rate (WER)2.16
7
Automatic Speech RecognitionAISHELL-1 1.0 (test)
CER (Offline, Rescoring)5.25
7
ASR Error CorrectionAISHELL-1 (dev)
WER3.8
6
Target Speaker ExtractionAISHELL Noisy zero-shot
SI-SDR10.2
5
Target Speaker ExtractionAISHELL zero-shot Clean
SI-SDR13.4
5
Timestamp ASRAISHELL-1 zh
AAS Score833.66
4
Automatic Speech RecognitionAISHELL-3 (test)
CER23.47
4
Speaker-aware ASRAISHELL-1 & AISHELL-2 augmented with VoxCeleb & MUSAN 20 dB SNR
WER1.24
4
Speaker-aware ASRAISHELL-1 & AISHELL-2 augmented with VoxCeleb & MUSAN 10 dB SNR
WER1.43
4
Speaker-aware ASRAISHELL-1 & AISHELL-2 augmented with VoxCeleb & MUSAN (2 dB SNR)
WER5.34
4
Showing 25 of 37 rows