Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Audio-visual Speech Recognition on LRS3 Pixelated face

26WER (Babble, -10 dB)

CAV2vec

24.82432.76240.748.638Dec 16, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
26124.72.51.99.45.83.62.52.31.73.29.64.22.61.91.745.17.31.5
2025.12
29.813.652.61.910.67.44.12.72.123.710.95.12.82.21.94.65.88.31.6
2025.12
30.113.15.22.6210.67.34.332.21.93.710.353.12.11.84.55.88.31.5
2025.12
30.614.35.22.6210.97.54.532.323.910.75.42.92.11.74.668.61.5
2025.12
39.121.65.72.71.914.242.43113.34.82.218.718.57.63.42.21.96.711.617.71.7
2025.12
43.725.28.53.62.816.86042.818.253.225.823.410.95.23.22.69.115.223.12.3
2025.12
55.431.813.77.55.322.744.428.316.810.1721.331.817.99.46.45.214.118.125.74.2