| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Knowledge-grounded reasoning | WISE | Overall Score89 | 45 | |
| World Knowledge Image Generation | WISE | Overall Score89 | 39 | |
| Reasoning-based text-to-image generation | WISE | Overall Score80 | 33 | |
| Text-to-Image Generation | WISE (test) | Overall Score80 | 32 | |
| Multimodal Understanding and Generation | WISE | Overall Accuracy80 | 29 | |
| Reasoning Generation | WISE 1.0 (test) | Overall Score80 | 17 | |
| World Knowledge Reasoning | WISE random subset of 200 samples | Cultural Accuracy94 | 17 | |
| Text-to-Image Generation | WISE | WISE Score0.7 | 13 | |
| Text-to-image generation | WISE 48 (test) | Cultural Score0.81 | 12 | |
| Text-to-Image Generation | WISE | Physics Score67 | 9 |