| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Speech Recognition | CMLR (Seen) | CER20.38 | 12 | |
| Lip Reading | CMLR (test) | CER21.49 | 11 | |
| Visual Speech Recognition | CMLR (Unseen) | CER38.23 | 8 | |
| Visual Speech Recognition | CMLR (test) | Inference Latency (ms)52.3 | 8 | |
| Visual Speech Recognition | CMLR | Best CER8 | 7 | |
| Lip Reading | CMLR 1.0 (test) | CER3.9 | 7 |