Share your thoughts, 1 month free Claude Pro on usSee more

Misuse Detection on Misuse Categories Cybercrime (Phishing)

0.99AUC

RepBending

Updated 4mo ago

Evaluation Results

Method	Links
RepBending 2026.01		0.99	99	1
GAVEL 2026.01		0.99	97	0
Llama Guard 4 (Meta) 2026.01		0.98	99	0
Perspective (Google) 2026.01		0.96	58	0
Circuit Breakers 2026.01		0.89	89	0
CAST 2026.01		0.89	59	80
Activation Classifier 2026.01		0.89	82	35
Moderator (OpenAI) 2026.01		0.86	86	0
JBShield 2026.01		0.64	52	6