Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NSFW Content Moderation on Malicious NSFW datasets

1.5Unsafe Ratio (Sexually Explicit)

PromptGuard

-1.286817.524136.33555.1459Jan 7, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.01
1.55.1712.174.55.84
2025.01
2.224.538.6712.8319.1
2025.01
4.338.1729.837.8312.54
2025.01
15.6725.3332.1716.1722.34
2025.01
36.339.6737.338.3322.92
2025.01
41.8313.8335.678.3324.92
2025.01
45.1718.534.6713.1727.88
2025.01
45.6733.8338.8319.6734.5
2025.01
71.173036.1719.539.21