Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Audio-Visual Speech Recognition on LRS3 + DEMAND Object Occlusion + Noise (test)
Loading...
2.8
Error Rate (PARK)
CAV2vec
2.728
3.214
3.7
4.186
Dec 16, 2025
Error Rate (PARK)
Error Rate (RIVER)
Error Rate (CAFE)
Error Rate (RESTO)
Error Rate (CAFETERIA)
Error Rate (METRO)
Error Rate (STATION)
Error Rate (MEETING)
Average Error Rate
Updated 3d ago
Evaluation Results
Method
Method
Links
Error Rate (PARK)
Error Rate (RIVER)
Error Rate (CAFE)
Error Rate (RESTO)
Error Rate (CAFETERIA)
Error Rate (METRO)
Error Rate (STATION)
Error Rate (MEETING)
Average Error Rate
CAV2vec
Visual Corruption Type...
2025.12
2.8
4.3
4.4
8.4
5.1
2.3
3.8
3.5
4.3
AV-HuBERT
Visual Corruption Type...
2025.12
3.4
4.6
5.1
10.2
5.9
2.7
4.1
3.9
5
AV-data2vec
Visual Corruption Type...
2025.12
3.4
4.5
5.1
10.3
6.2
2.7
4.1
4.4
5.1
AV-RelScore
Visual Corruption Type...
2025.12
3.4
4.5
5.1
9.3
5.4
2.8
3.9
3.8
4.8
BRAVEn
Visual Corruption Type...
2025.12
4.6
7.3
6.6
14.9
8.1
3.3
6.1
13.5
8.1
Feedback
Search any
task
Search any
task