Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Audio-Visual Speech Recognition benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Audio-Visual Speech Recognition
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LRS3 (test)
AVUR-LLM
WER
0.68
77
16d ago
LRS3 clean (test)
MMS-LLAMA
WER
0.72
77
1mo ago
LRS2 (test)
USR 2.0
WER
1.3
34
1mo ago
LRS-3 Babble noise at 0dB SNR (test)
LP Conformer
WER
1.9
32
1mo ago
LRS3 30h labeled low-resource (test)
DistillAV-L
WER
1.8
22
1mo ago
LRS2 (clean)
MIR-GAN
WER
2.2
16
16d ago
LRS3
Llama-AVSR
WER
0.008
14
25d ago
WildVSR (test)
USR 2.0
WER
0.385
12
1mo ago
LRS3 (test)
GER w/ Auto-AVSR
Overall Score
43
10
1mo ago
LRS2 50% visual occlusion (test)
ASR + VSR oracle onb / ocp
WER (Overall)
6.4
10
1mo ago
TED LRS3
VGG CONFORMER
WER
0.009
10
1mo ago
MuAViC Noise environment (test)
XLAVS-R 2B
Accuracy (En)
49.5
9
1mo ago
MuAViC Clean environment (test)
XLS-R 300M
En Acc
2.5
9
1mo ago
LRS3 noisy
AV-HuBERT + CMA + MoHAVE
Average Error Rate
4.2
8
1mo ago
LRS3 433 h 0 dB SNR
AVUR-LLM
WER
1.7
7
1mo ago
LRS3 433 h 5 dB SNR
MMS-LLaMA
WER
1.3
7
1mo ago
LRS3 Pixelated face
CAV2vec
WER (Babble, -10 dB)
26
7
1mo ago
LRS3 Occlusion by hands
CAV2vec
WER (Babble, -10 dB)
26.6
7
1mo ago
LRS3 Object occlusion and noise
CAV2vec
WER (Babble, -10 dB)
25.8
7
1mo ago
LRS3 noisy synthesized using MUSAN noise (test)
MIR-GAN
WER
5.6
7
1mo ago
MuAViC (test)
AV-HUBERT
Accuracy (Ara)
89.4
7
1mo ago
LRS2 noisy (MUSAN)
MIR-GAN
WER
7
6
1mo ago
LRS3 + DEMAND Object Occlusion + Noise (test)
CAV2vec
Error Rate (PARK)
2.8
5
1mo ago
FLEURS Noise environment (test)
XLAVS-R 2B
WER
74
5
1mo ago
FLEURS Clean environment (test)
XLAVS-R 300M
WER
32.5
5
1mo ago
Showing 25 of 49 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs