| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Image Generation | DPG-Bench | Overall Score88.63 | 451 | |
| Text-to-Image Generation | DPG-Bench | DPG Score88.9 | 156 | |
| Text-to-image generation | DPG-Bench | Average Score88.79 | 77 | |
| Text-to-Image Generation | DPG-Bench (test) | Overall Fidelity88.32 | 68 | |
| Text-to-Image Generation | DPG-Bench | DPG Score89 | 36 | |
| Text-to-Image Alignment | DPG-Bench | Global Alignment Score91.7 | 20 | |
| Dense prompt following | DPG-Bench v1.0 (test) | Entity Score93.08 | 20 | |
| Text-to-Image Generation | DPG-Bench | Overall Score88.32 | 19 | |
| Image Generation | DPG-Bench 130 | Score88.3 | 15 | |
| Text-to-Image Generation | DPG-Bench | DPG Percentage Score88.79 | 11 | |
| Dense Prompt Alignment | DPG-Bench | Overall Score85.08 | 11 | |
| Text-to-Image | DPG-Bench | DPG-Bench Score86.14 | 10 | |
| Text-to-Image Generation | DPG-Bench 17 | Global Score90.97 | 8 | |
| Human preference, image quality, and aesthetics comparison | DPG-Bench | DeQA Score4.12 | 8 | |
| Text-to-3D Generation | DPG-Bench 1.0 (test) | Global Score81.82 | 7 | |
| Dense prompt-following | DPG-Bench | Score85.43 | 6 | |
| Text-to-image alignment | DPG-Bench (test) | DPG77.13 | 6 | |
| Text-to-Image Generation | DPG-Bench zero-shot | DPG-Bench Score (Zero-Shot)70.1 | 5 | |
| Multimodal Generation | DPG-Bench | Global Score82.37 | 3 |