Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Misuse Detection on Misuse Categories Automation (e-commerce)
Loading...
100
AUC
Activation Classifier
6.4
30.7
55
79.3
Jan 27, 2026
AUC
Balanced Accuracy
FPR
Updated 4d ago
Evaluation Results
Method
Method
Links
AUC
Balanced Accuracy
FPR
Activation Classifier
Category=Classifier, B...
2026.01
100
89
0
GAVEL
Category=Classifier, B...
2026.01
99
95
0
RepBending
Category=Fine-Tuning,...
2026.01
97
98
2
CAST
Category=Inference-Tim...
2026.01
94
52
96
Llama Guard 4 (Meta)
Category=Moderation, B...
2026.01
91
94
4
Perspective (Google)
Category=Moderation, B...
2026.01
70
59
0
Circuit Breakers
Category=Fine-Tuning,...
2026.01
50
50
0
Moderator (OpenAI)
Category=Moderation, B...
2026.01
50
50
0
JBShield
Category=Inference-Tim...
2026.01
10
48
5
Feedback
Search any
task
Search any
task