| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| User Study | PC-AVS | Lip Sync Quality4.46 | 18 | 4d ago | |
| Conver-3D YouTube (test) | ViBES | FDD2.76 | 9 | 4d ago | |
| Self-Collected Dataset 50 identities | X-Portrait | FID32.77 | 6 | 4d ago | |
| VFHQ (first 100 frames) | X-Portrait | FID26.22 | 6 | 4d ago | |
| HDTF (test) | EchoMimic | IQA3.994 | 6 | 4d ago | |
| CelebV-HQ (test) | Ours-Balanced | IQA3.588 | 6 | 4d ago | |
| HDTF Set B (test) | SyncNet (Offset)-3 | 6 | 4d ago | ||
| HDTF Set A (test) | SyncNet Offset-4 | 6 | 4d ago | ||
| Talking Head Synthesis Datasets (test) | Narrating For You | PSNR35.94 | 5 | 4d ago | |
| Talkinghead1kh | Face-V2V | PSNR24.17 | 5 | 4d ago | |
| HDTF | Face-V2V | PSNR27 | 5 | 4d ago |