Share your thoughts, 1 month free Claude Pro on usSee more

Audio-Visual Speech Recognition on LRS2 (clean)

2.2WER

MIR-GAN

Updated 3mo ago

Evaluation Results

Method	Links
MIR-GAN 2023.06		2.2
Base model 2023.06		2.3
MoCo+wav2vec 2023.06		2.6
MIR-GAN 2023.06		3.2
Hyb-Conformer 2023.06		3.7
Base model 2023.06		3.9
MIR-GAN 2023.06		4.5
Base model 2023.06		5.4
LF-MMI TDNN 2023.06		5.9
Hyb-RNN 2023.06		7
TM-CTC 2023.06		8.2
TM-seq2seq 2023.06		8.5
VisG AV-HuBERT Large 2026.04		9.925
AV-HuBERT Base 2026.04		10.09
AV-HuBERT Large 2026.04		10.3
VisG AV-HuBERT Base 2026.04		10.58