Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Audio-Visual Speech Recognition on LRS3 + DEMAND Object Occlusion + Noise (test)
Loading...
2.8
Error Rate (PARK)
CAV2vec
2.728
3.214
3.7
4.186
Dec 16, 2025
Error Rate (PARK)
Error Rate (RIVER)
Error Rate (CAFE)
Error Rate (RESTO)
Error Rate (CAFETERIA)
Error Rate (METRO)
Error Rate (STATION)
Error Rate (MEETING)
Average Error Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Error Rate (PARK)
Error Rate (RIVER)
Error Rate (CAFE)
Error Rate (RESTO)
Error Rate (CAFETERIA)
Error Rate (METRO)
Error Rate (STATION)
Error Rate (MEETING)
Average Error Rate
CAV2vec
Visual Corruption Type...
2025.12
2.8
4.3
4.4
8.4
5.1
2.3
3.8
3.5
4.3
AV-HuBERT
Visual Corruption Type...
2025.12
3.4
4.6
5.1
10.2
5.9
2.7
4.1
3.9
5
AV-data2vec
Visual Corruption Type...
2025.12
3.4
4.5
5.1
10.3
6.2
2.7
4.1
4.4
5.1
AV-RelScore
Visual Corruption Type...
2025.12
3.4
4.5
5.1
9.3
5.4
2.8
3.9
3.8
4.8
BRAVEn
Visual Corruption Type...
2025.12
4.6
7.3
6.6
14.9
8.1
3.3
6.1
13.5
8.1
Feedback
Search any
task
Search any
task