| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Speech Recognition | CMLR (Seen) | CER20.38 | 12 | |
| Lip Reading | CMLR (test) | CER21.49 | 11 | |
| Inter-frame Flickering Evaluation | CMLR (test) | Error0.4 | 10 | |
| Talking-head Generation | CMLR (test) | FID25.19 | 9 | |
| Visual Speech Recognition | CMLR (Unseen) | CER38.23 | 8 | |
| Visual Speech Recognition | CMLR (test) | Inference Latency (ms)52.3 | 8 | |
| Visual Speech Recognition | CMLR | Best CER8 | 7 | |
| Lip Reading | CMLR 1.0 (test) | CER3.9 | 7 |