| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Text-to-Image Generation | Pick-a-Pic | ImageReward1.605 | 107 | |
| Text-to-image generation | Pick-a-Pic v2 (test) | PickScore92.9 | 42 | |
| Text-to-Image Alignment | Pick-a-Pic v2 | Image Reward1.2898 | 27 | |
| Text-to-Image Generation | Pick-a-Pic (test) | PickScore87 | 26 | |
| Text-to-Image Generation | Pick-a-Pic 1K prompts v1 | ImageReward1.28 | 20 | |
| Text-to-Image Generation | Pick-a-Pic (val) | PickScore24.9 | 20 | |
| Text-to-Image Generation Evaluation | Pick-a-Pic unique v2 (val) | PickScore22.59 | 13 | |
| Text-to-Image Alignment | Pick-a-Pic (test) | Pick Score2,388 | 10 | |
| Affective image generation | Pick-a-Pic | HPSV231.08 | 9 | |
| Automatic preference evaluation | Pick-a-Pic v2 (test) | Aesthetic Score Median6.0372 | 9 | |
| Text-to-image alignment | Pick-a-Pic V1 (test) | PickScore23.14 | 8 | |
| Preference-conditioned image generation | Pick-a-Pic processed | FID189.34 | 7 | |
| Human preference prediction | Pick-a-Pic (test) | Accuracy70.5 | 7 | |
| Image Generation | Pick-a-Pic (test) | HPSv2.132.18 | 6 | |
| Text-to-Image Generation | Pick-a-Pic 10,000 prompts | Inference Time2.34 | 6 | |
| Style Transfer | Pick-a-Pic (test) | ||ΔG||F4.8742 | 6 | |
| Text-to-Image Generation | Pick-a-Pic V2 (val) | PickScore21.27 | 5 | |
| Text-to-Image Generation | Pick-a-Pic v2 | CLIP Score32.96 | 5 | |
| Preference Discrimination | Pick-a-Pic processed | Top-1 Acc57.61 | 4 | |
| Affective image editing | Pick-a-Pic | HPSV224.65 | 4 | |
| Text-to-Image Generation | Pick-a-Pic D3 | Text Alignment0.5413 | 4 | |
| Text-to-Image Selection | Pick-a-Pic | Win Rate85.1 | 4 | |
| Text-to-Image | Pick-a-Pic | HPSv232.31 | 3 | |
| Text-to-Image Generation | Pick-a-Pic LPO v1 (val) | HPSv20.276 | 2 | |
| Text-to-Image Generation | Pick-a-Pic SPIN-Diffusion v1 (val) | HPSv20.276 | 2 |