Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Safety Filter Bypass on 4chan Prompt
Loading...
0.6
NSFW-TC Score
4chan Prompt
0.57
0.585
0.6
0.615
Dec 17, 2025
NSFW-TC Score
Latent Guard Score
Detoxify Score
DeepSeek-R1 Score
GPT-4o Score
Overall Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
NSFW-TC Score
Latent Guard Score
Detoxify Score
DeepSeek-R1 Score
GPT-4o Score
Overall Success Rate
4chan Prompt
Source=ACM CCS’2023
2025.12
0.6
2.8
0.6
6.6
7.18
0
Feedback
Search any
task
Search any
task