| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Lip-reading | LRW-1000 (test) | Accuracy55.7 | 50 | |
| Lip-reading Classification | LRW (test) | Accuracy88.5 | 38 | |
| Lip Reading | LRW 1.0 (test) | Top-1 Accuracy94.8 | 37 | |
| Talking Face Generation | LRW (test) | SSIM1 | 28 | |
| Lip Reading | LRW original (test) | Top-1 Accuracy88.5 | 14 | |
| Lip Reading | LRW Word-level (test) | Accuracy88.8 | 13 | |
| Word Recognition | LRW (test) | Correct Rate98 | 13 | |
| Lip Reading Classification | LRW-1000 cropped mouth regions (test) | Top-1 Accuracy0.466 | 9 | |
| Lip Reading | LRW (test) | Word Accuracy85.4 | 8 | |
| Talking Head Generation | LRW 38 | LSE-C1.762 | 6 | |
| Lip Reading | LRW | Accuracy84.8 | 6 | |
| Visual Speech Recognition | LRW | Top-1 Accuracy80.3 | 5 | |
| Lip-syncing | LRW 8 (test) | LSE-D6.512 | 5 | |
| Word-level lip reading | LRW-1000 | Accuracy33.1 | 4 | |
| Visual Keyword Spotting | LRW 16 (test) | Top-1 Class Accuracy85.8 | 2 | |
| Lip-to-Speech Synthesis | LRW (test) | WER0.342 | 2 |