| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Automatic Speech Recognition | LRS2-BBC (test) | WER0.056 | 21 | |
| Automatic Speech Recognition | LRS2-BBC Noisy (test) | WER23.6 | 9 | |
| Lip-to-speech synthesis | LRS2-BBC (test) | UTMOS4.1664 | 7 | |
| Video-to-speech synthesis | LRS2-BBC (test) | UTMOS3.921 | 7 |