| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Image Generation | Qwen-Image | Latency (s)1.401 | 25 | |
| Text-to-Image Generation | Qwen-Image Evaluation Set | Latency (s)22.4 | 12 | |
| Resolution extrapolation | Qwen-Image Direct extrapolation (test) | FID78.15 | 6 | |
| Generative Diversity Evaluation | Qwen-Image | DINOv3 Score0.958 | 3 | |
| AI-generated image detection | Qwen-Image | Accuracy94.1 | 3 | |
| Multimodal Inference | Qwen-Image (inference) | Inference Latency (s)14.92 | 2 |