Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Visual Speech Recognition benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Visual Speech Recognition
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LRS3 (test)
Llama-AVSR
WER
0.77
240
1d ago
LRS3 High-Resource, 433h labelled v1 (test)
Chang et al.
WER
0.009
80
3mo ago
LRS3
Auto-AVSR
WER
0.009
63
2mo ago
LRS2
AutoAVSR
Mean WER
14.6
49
2mo ago
LRS3 Low-Resource 30h labelled v1 (test)
USR
WER
0.024
34
3mo ago
LRS3 30h labeled low-resource (test)
UASR-LLM-L
WER
25.3
28
3mo ago
DVSpeaker (Cross-scene)
NeuroLip
SI (45°)
71.68
21
1mo ago
DVSpeaker Matched-scene
Get
SI Accuracy (0°)
100
21
1mo ago
LRS3 low-resource (test)
Serdyuk et al. (2022)
WER
19.3
20
3mo ago
LRS3 high-resource (test)
RAVEn w/ self-training
WER
23.1
16
3mo ago
WildVSR
Auto-AVSR
WER
38.6
15
3mo ago
LRS2 v0.4 (test)
Ours (raw A + V)
WER
3.7
14
3mo ago
CMLR (Seen)
Cascade-Free Mandarin VSR
CER
20.38
12
2mo ago
LSVSR (test)
Audio-Ph
Word Error Rate
18.3
10
3mo ago
LRS3 v0.4 (test)
Ours (raw A + V)
WER
2.3
9
3mo ago
CMLR (Unseen)
Cascade-Free Mandarin VSR
CER
38.23
8
2mo ago
CMLR (test)
Lipnet
Inference Latency (ms)
52.3
8
2mo ago
CMLR
VSR model with prediction-based auxiliary tasks
Best CER
8
7
3mo ago
CNVSRC-Multi Mandarin (dev)
VALLR-Pin
CER
24.1
6
3mo ago
LRW
SyncVSR
Top-1 Accuracy
80.3
5
3mo ago
LRS3 v0.0 (test)
Ours (raw A + V)
WER
1.2
5
3mo ago
Self-Collected Dataset Mandarin (test)
VALLR-Pin
CER
32.22
4
3mo ago
CMU-MOSEAS-Spanish (CMes)
CM-seq2seq
Best Score
58.1
4
3mo ago
CMU-MOSEAS-Portuguese (CMpt) (test)
VSR model with prediction-based auxiliary tasks
Mean WER
51.6
4
3mo ago
Multilingual TEDx-Spanish (MTes) (test)
VSR model with prediction-based auxiliary tasks
Mean WER
56.6
4
3mo ago
Showing 25 of 30 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs