Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Harmful Content Detection on Standard Harmful Content Datasets Evasion Attack
Loading...
96
Phishing
GAVEL
85.6
88.3
91
93.7
Jan 27, 2026
Phishing
SQL Injection
Delusional
Anti-LGBTQ
Elections
Racism
Tax Authority
Romance
E-commerce
Updated 4d ago
Evaluation Results
Method
Method
Links
Phishing
SQL Injection
Delusional
Anti-LGBTQ
Elections
Racism
Tax Authority
Romance
E-commerce
GAVEL
2026.01
96
95
97
100
100
98
71
97
100
GPT4
Rule definitions=true
2026.01
91
95
56
66
85
87
91
92
97
GPT4
Rule definitions=false
2026.01
86
91
29
80
54
94
100
91
84
Feedback
Search any
task
Search any
task