| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Single-object 4D Motion Generation | User Study Single-object 4D Motion Generation 1.0 (test) | Prompt Alignment47 | 36 | |
| Image Editing | User Study 100 images (test) | User Selection Rate94.3 | 32 | |
| Image Style Transfer | User Study | Overall Quality Score83.9 | 30 | |
| Talking head synthesis | User Study | Lip Sync Quality4.46 | 18 | |
| Qualitative Interface Comparison | User Study (N=24) (between-subjects) | Mentions10 | 17 | |
| Image Personalization | User Study Personalization Tasks | Concept Preservation (CP)95.3 | 17 | |
| Task-Oriented Robot-Human Handover | User Study Franka Panda | Failure Rate37 | 16 | |
| Text-to-Image Generation | User Study 12 Prompts (test) | Win Rate (Full Description)82.84 | 13 | |
| Single-character story generation | User Study | C-A Score4.62 | 13 | |
| Image Composition | User Study | Average Ranking1.52 | 13 | |
| Talking Face Emotion Editing | User Study Extended Emotion | Emotional Accuracy91 | 12 | |
| Talking Face Emotion Editing | User Study Basic Emotion | Emotional Expression84.5 | 12 | |
| Text-to-Image Generation | User Study Human Evaluation | VisualPrompter Preference60 | 12 | |
| Image Inpainting | User Study 40 random images (test) | UOM1.6 | 12 | |
| Text Alignment | User Study | Average Ranking1.54 | 12 | |
| Talking Head Generation | User Study | Lip Sync156 | 11 | |
| User Satisfaction Evaluation | User Study Industry | Average Score55.01 | 10 | |
| User Satisfaction Evaluation | User Study Navigation | Average Score85.07 | 10 | |
| User Satisfaction Evaluation | User Study Shopping | Average Satisfaction Score79.36 | 10 | |
| Facial Reconstruction | User Study | ID Consistency4.85 | 10 | |
| 3D Motion Generation | User Study | Motion Realism Preference80 | 10 | |
| Subjective Image Quality Assessment | User Study (test) | Average Rank1.17 | 10 | |
| Style Transfer | User Study 10 content images, 8 style images (test) | Style Score54.6 | 9 | |
| Visual Dubbing | User Study | Realism4.4 | 9 | |
| Character Animation | User Study 20 identities and 20 driving videos (test) | Video Quality0.9 | 9 |