Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Audio-Visual Speech Recognition on LRS3 noisy synthesized using MUSAN noise (test)
Loading...
5.6
WER
MIR-GAN
5.232
7.716
10.2
12.684
Jun 18, 2023
WER
Updated 4d ago
Evaluation Results
Method
Method
Links
WER
MIR-GAN
Backbone=Transformer,...
2023.06
5.6
AV-HuBERT
Backbone=Transformer,...
2023.06
5.8
Base model
Backbone=Transformer,...
2023.06
5.8
MIR-GAN
Backbone=Conformer, Cr...
2023.06
8.5
Base model
Backbone=Conformer, Cr...
2023.06
10.9
MIR-GAN
Backbone=Transformer,...
2023.06
11.7
Base model
Backbone=Transformer,...
2023.06
14.8
Feedback
Search any
task
Search any
task