Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Audio-visual Speech Recognition on LRS3 Occlusion by hands

26.6WER (Babble, -10 dB)

CAV2vec

25.37633.63841.950.162Dec 16, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
26.612.44.52.61.89.66.23.62.62.21.73.39.44.82.61.91.74.15.27.41.5
2025.12
31.714.55.12.7211.27.74.22.62.21.93.711.25.33.12.11.74.76.18.71.6
2025.12
3215.55.32.7211.58.14.232.323.911.55.32.92.31.84.86.291.6
2025.12
33.215.65.72.6211.88.14.72.82.31.9412.362.92.21.956.59.41.5
2025.12
38.321.65.72.521441.82913.94.62.418.418.57.53.32.21.96.711.417.41.7
2025.12
4625.894.22.817.660.64418.95.52.926.423.711.45.13.42.79.315.623.72.2
2025.12
57.231.913.77.55.323.145.729.317.1106.821.832.3189.26.25.414.218.326.24.1