| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| World Knowledge Image Generation | WISE | Overall Score89 | 110 | |
| Reasoning-based text-to-image generation | WISE | Overall Score87 | 70 | |
| Knowledge-grounded reasoning | WISE | Overall Score89 | 68 | |
| Text-to-Image Generation | WISE | WISE Score0.7 | 67 | |
| Multimodal Understanding and Generation | WISE | Overall Accuracy80 | 65 | |
| Text-to-image generation | WISE | Culture Score0.93 | 51 | |
| Text-to-Image Generation | WISE | Cultural Score81 | 48 | |
| Text-to-Image Generation | WISE (test) | Overall Score80 | 45 | |
| Reasoning Generation | WISE 1.0 (test) | Overall Score80 | 17 | |
| World Knowledge Reasoning | WISE random subset of 200 samples | Cultural Accuracy94 | 17 | |
| Image Generation | WISE | Score75 | 12 | |
| Text-to-image generation | WISE 48 (test) | Cultural Score0.81 | 12 | |
| Knowledge-informed text-to-image generation | WISE 96 | Cultural Score63 | 10 | |
| Text-to-Image Generation | WISE | Physics Score67 | 9 | |
| World Knowledge-aware Image Generation | WISE | Consistency1.26 | 2 |