Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NSFW

Benchmarks

Task NameDataset NameSOTA ResultTrend
NSFW Concept GenerationNSFW-200 Violence v2.1 (test)
ASR-166
70
NSFW Concept GenerationNSFW-200 Sex v2.1 (test)
ASR-162
70
Concept Unlearning PreservationNSFW
CSDR12.76
12
Adversarial RobustnessNSFW
ASR4.69
11
Harmful prompt detectionNSFW56k
Accuracy99
6
NSFW DetectionNSFW56k
Acceptance Rate (ASR)97
5
NSFW Safety EvaluationNSFW
Metric-
0
Showing 7 of 7 rows