Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LRS3

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Speech RecognitionLRS3 (test)
WER0.77
159
Visual Speech RecognitionLRS3 High-Resource, 433h labelled v1 (test)
WER0.009
80
Audio-Visual Speech RecognitionLRS3 clean (test)
WER0.72
70
Visual Speech RecognitionLRS3
WER0.009
59
Automatic Speech RecognitionLRS3 (test)
WER (%)0.79
46
Visual Speech RecognitionLRS3 Low-Resource 30h labelled v1 (test)
WER0.024
34
English transcriptionLRS3 Noisy 0-SNR (test)
WER0.046
25
Speech RecognitionLRS3-TED
WER7.2
25
Automatic Speech RecognitionLRS3 Clean original (test)
WER0.68
21
Visual Speech RecognitionLRS3 low-resource (test)
WER19.3
20
Audio-Visual Speech SeparationLRS3 (test)
SDRi18.5
20
Automatic Speech RecognitionLRS3 433-hour labeled (test)
WER (%)1.3
19
Lip ReadingLRS3 1.0 (test)
WER25.51
19
Speech RecognitionLRS3 high-resource
WER (V)17.6
18
Speech RecognitionLRS3 low-resource
WER (V)23.7
18
Audio-Visual Speech RecognitionLRS3 (test)
WER0.9
18
Automatic Speech RecognitionLRS3 low-resource (test)
WER0.016
18
Automatic Lip-ReadingLRS3 v1 (dev)
WER16.92
18
Speech EnhancementLRS3 mixed with QUT city-street noises (test)
PESQ3.21
18
Speech EnhancementLRS3 mixed with VGGSound noises (test)
PESQ3.25
18
Automatic Speech RecognitionLRS3 High-Resource 433h labelled v1 (test)
WER0.012
16
Visual Speech RecognitionLRS3 high-resource (test)
WER23.1
16
Automatic Speech RecognitionLRS3 Low-Resource 30h labelled v1 (test)
WER (%)2.3
15
Audio Speech RecognitionLRS3
WER0.7
14
Speech SeparationLRS3-2Mix (test)
SDRi17.5
11
Showing 25 of 67 rows