Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Misuse Detection on Misuse Categories Cybercrime (Phishing)
Loading...
0.99
AUC
RepBending
0.626
0.7205
0.815
0.9095
Jan 27, 2026
AUC
b-ACC
FPR
Updated 4d ago
Evaluation Results
Method
Method
Links
AUC
b-ACC
FPR
RepBending
Category=Fine-Tuning,...
2026.01
0.99
99
1
GAVEL
Category=Classifier, B...
2026.01
0.99
97
0
Llama Guard 4 (Meta)
Category=Moderation, B...
2026.01
0.98
99
0
Perspective (Google)
Category=Moderation, B...
2026.01
0.96
58
0
Circuit Breakers
Category=Fine-Tuning,...
2026.01
0.89
89
0
CAST
Category=Inference-Tim...
2026.01
0.89
59
80
Activation Classifier
Category=Classifier, B...
2026.01
0.89
82
35
Moderator (OpenAI)
Category=Moderation, B...
2026.01
0.86
86
0
JBShield
Category=Inference-Tim...
2026.01
0.64
52
6
Feedback
Search any
task
Search any
task