| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Red-teaming | I2P Nudity prompts | Failure Rate (FR)0 | 48 | |
| Explicit Content Removal | I2P | Buttocks Count0 | 47 | |
| Nudity Erasure | I2P | Total Count743 | 44 | |
| Concept Erasure | I2P | Number of Exposed Body Parts2 | 30 | |
| Nudity Detection | I2P | Breast (F) Detections298 | 29 | |
| Safety Generalization | i2p (test) | Self-Harm Score85.31 | 24 | |
| Concept Erasure | I2P | I2P Success Rate69.6 | 23 | |
| Safe Text-to-Image Generation | I2P | Inappropriate Probability4 | 23 | |
| Broad-concept removal | I2P | Self-harm Removal Rate47.9 | 22 | |
| Concept Restoration | I2P Successful Generations | Aesthetic Score49.67 | 21 | |
| Concept Restoration | I2P (All Prompts) | Aesthetic Score0.5055 | 21 | |
| Adversarial Attack | I2P | Attack Success Rate (ASR) (NudeNet)99 | 21 | |
| Explicit Content Unlearning | I2P | Total Count838 | 21 | |
| Concept Unlearning | I2P | I2P0.0011 | 17 | |
| Nudity Erasure | I2P 1.0 (test) | ASR (UD Attack)90.27 | 16 | |
| Nudity unlearning | I2P | Armpits Count153 | 15 | |
| Inappropriate Content Erasing | I2P | I2P (%)17.8 | 14 | |
| Concept Erasure | I2P | ASR (%)2.4 | 14 | |
| Unlearning Nudity | I2P | Nudity Generation Rate6.3 | 13 | |
| Nudity erasure | I2P 1.5 (test) | Nudity Generation Rate2.8 | 13 | |
| Safe Text-to-Image Generation | I2P | ASR0.023 | 13 | |
| Safe Image Generation | I2P | Average Violation Frequency11 | 13 | |
| Erase Effectiveness | I2P sexual 1.0 (test) | Total Erased Count1,070 | 13 | |
| Explicit Content Erasure | I2P (931 prompts) | Exposed Body-Part Detections245 | 12 | |
| Safe Image Generation | I2P (test) | Q16 IP0.8 | 12 |