Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

I2P

Benchmarks

Task NameDataset NameSOTA ResultTrend
Red-teamingI2P Nudity prompts
Failure Rate (FR)0
48
Explicit Content RemovalI2P
Buttocks Count0
47
Nudity ErasureI2P
Total Count743
44
Concept ErasureI2P
Number of Exposed Body Parts2
30
Nudity DetectionI2P
Breast (F) Detections298
29
Safety Generalizationi2p (test)
Self-Harm Score85.31
24
Concept ErasureI2P
I2P Success Rate69.6
23
Safe Text-to-Image GenerationI2P
Inappropriate Probability4
23
Broad-concept removalI2P
Self-harm Removal Rate47.9
22
Concept RestorationI2P Successful Generations
Aesthetic Score49.67
21
Concept RestorationI2P (All Prompts)
Aesthetic Score0.5055
21
Adversarial AttackI2P
Attack Success Rate (ASR) (NudeNet)99
21
Explicit Content UnlearningI2P
Total Count838
21
Concept UnlearningI2P
I2P0.0011
17
Nudity ErasureI2P 1.0 (test)
ASR (UD Attack)90.27
16
Nudity unlearningI2P
Armpits Count153
15
Inappropriate Content ErasingI2P
I2P (%)17.8
14
Concept ErasureI2P
ASR (%)2.4
14
Unlearning NudityI2P
Nudity Generation Rate6.3
13
Nudity erasureI2P 1.5 (test)
Nudity Generation Rate2.8
13
Safe Text-to-Image GenerationI2P
ASR0.023
13
Safe Image GenerationI2P
Average Violation Frequency11
13
Erase EffectivenessI2P sexual 1.0 (test)
Total Erased Count1,070
13
Explicit Content ErasureI2P (931 prompts)
Exposed Body-Part Detections245
12
Safe Image GenerationI2P (test)
Q16 IP0.8
12
Showing 25 of 71 rows