| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-image generation | GenEval | Overall Score96 | 506 | |
| Text-to-Image Generation | GenEval | Overall Score95 | 391 | |
| Text-to-Image Generation | GenEval | GenEval Score95 | 360 | |
| Text-to-Image Generation | GenEval (test) | Two Obj. Acc99 | 221 | |
| Text-to-Image Generation | GenEval | Overall Score94 | 218 | |
| Text-to-Image Generation | GenEval | Overall Score88.3 | 96 | |
| Text-to-Image Generation | GenEval | GenEval Score0.9 | 88 | |
| Text-to-Image Generation | GenEval 1.0 (test) | Overall Score84 | 85 | |
| Image Generation | GenEval | Overall Score89 | 57 | |
| Text-to-Image Generation | GenEval++ | Color Accuracy90 | 45 | |
| Compositional Image Generation | GenEval | Overall Score0.99 | 44 | |
| Image Generation | GenEval (test) | GenEval Score91 | 35 | |
| Text-to-Image Generation | GenEval (val) | GenEval Score90 | 33 | |
| Visual Generation | GenEval | Single Obj. Acc99 | 31 | |
| Text-to-image reward alignment | GenEval (test) | Reward 1 Score (r1)0.26 | 30 | |
| Image Generation | GenEval overall | GenEval Overall Score90 | 30 | |
| Text-to-Image Generation | GenEval | Two Objects Score96.97 | 27 | |
| Text-to-Image Generation | GenEval 1024x1024 | Overall Score (GenEval)0.8 | 23 | |
| Multimodal Generation | GenEval | Score90 | 21 | |
| Composition Image Generation | GenEval | GenEval Score97 | 20 | |
| Text-to-Image | GenEval 11 (test) | Accuracy (Single Obj)100 | 19 | |
| Text-to-Image Generation | GenEval | GE Score61.9 | 18 | |
| Text-to-Image Generation | GenEval | DINO0.786 | 18 | |
| Text-to-Image Generation | GenEval 2 | Soft TIFA AM80 | 17 | |
| Text-to-Image Generation | GenEval | GenEval Score77 | 17 |