Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

I2P

Benchmarks

Task NameDataset NameSOTA ResultTrend
Explicit Content RemovalI2P
Armpits Count2
28
Safe Text-to-Image GenerationI2P
Inappropriate Probability4
23
Concept UnlearningI2P
I2P0.0011
17
Nudity ErasureI2P 1.0 (test)
ASR (UD Attack)90.27
16
Inappropriate Content ErasingI2P
I2P (%)17.8
14
Safe Text-to-Image GenerationI2P
ASR0.023
13
Erase EffectivenessI2P sexual 1.0 (test)
Total Erased Count1,070
13
Common RobustnessI2P
ASR0.7
12
Nudity UnlearningI2P
ESD71.83
11
Explicit Content UnlearningI2P
Armpits153
11
Explicit Content UnlearningI2P v1.5 (test)
Armpits162
10
NSFW Concept ErasureI2P 4,703 potentially unsafe prompts
Total Success Count605
10
Nudity DetectionI2P (test)
Common Detections Count406
10
Text-to-Image GenerationI2P
Harmful Rate0.123
9
Text-to-Image SafetyI2P
Harmful Rate10.2
9
Explicit Content ErasureI2P benchmark
NN Score0
9
Safe Generation RateI2P
GPT-4o Score83.88
9
Prompt-Image AlignmentI2P
CLIPScore0.8514
8
Concept ErasureI2P
Nudity Rate1.83
7
Concept AttackI2P Violence concept
FLUX.1 ASR91.06
6
Concept AttackI2P Nudity concept
FLUX.1 ASR100
6
Text-to-Image Inappropriateness EvaluationI2P
Prob Inappropriate (Hate)0.09
6
Concept ErasureI2P
ASR (%)2.4
5
Safe Image GenerationI2P
Sexual Violation Frequency15
5
Safety-Sensitive Text-to-Image GenerationI2P Overall
Inappropriate Probability10
5
Showing 25 of 34 rows