Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Visual Speech Recognition benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Visual Speech Recognition
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LRS3 (test)
Llama-AVSR
WER
0.77
159
4d ago
LRS3 High-Resource, 433h labelled v1 (test)
Chang et al.
WER
0.009
80
4d ago
LRS3
Auto-AVSR
WER
0.009
59
4d ago
LRS2
AutoAVSR
Mean WER
14.6
45
4d ago
LRS3 Low-Resource 30h labelled v1 (test)
USR
WER
0.024
34
4d ago
LRS3 low-resource (test)
Serdyuk et al. (2022)
WER
19.3
20
4d ago
LRS3 high-resource (test)
RAVEn w/ self-training
WER
23.1
16
4d ago
WildVSR
Auto-AVSR
WER
38.6
15
4d ago
LRS2 v0.4 (test)
Ours (raw A + V)
WER
3.7
14
4d ago
LSVSR (test)
Audio-Ph
Word Error Rate
18.3
10
4d ago
LRS3 v0.4 (test)
Ours (raw A + V)
WER
2.3
9
4d ago
CMLR
VSR model with prediction-based auxiliary tasks
Best CER
8
7
4d ago
CNVSRC-Multi Mandarin (dev)
VALLR-Pin
CER
24.1
6
4d ago
LRW
SyncVSR
Top-1 Accuracy
80.3
5
4d ago
LRS3 v0.0 (test)
Ours (raw A + V)
WER
1.2
5
4d ago
Self-Collected Dataset Mandarin (test)
VALLR-Pin
CER
32.22
4
4d ago
CMU-MOSEAS-Spanish (CMes)
CM-seq2seq
Best Score
58.1
4
4d ago
CMU-MOSEAS-Portuguese (CMpt) (test)
VSR model with prediction-based auxiliary tasks
Mean WER
51.6
4
4d ago
Multilingual TEDx-Spanish (MTes) (test)
VSR model with prediction-based auxiliary tasks
Mean WER
56.6
4
4d ago
CMU-MOSEAS French
VSR model with prediction-based auxiliary tasks
Mean WER
59.1
4
4d ago
Multilingual TEDx-Portuguese (MTpt) (test)
CM-seq2seq
Mean Accuracy
70.2
4
4d ago
Multilingual TEDx Italian (MTit) (test)
VSR model with prediction-based auxiliary tasks
Mean WER
57.9
4
4d ago
LRS3-TED Full (test)
V2P
WER
55.1
2
4d ago
LRS3-TED Filtered (test)
V2P
WER
47
1
4d ago
Showing 24 of 24 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Terms of Service
FAQs