Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Misuse Categories

Benchmarks

Task NameDataset NameSOTA ResultTrend
Misuse DetectionMisuse Categories Aggregate Summary
AUC99
9
Misuse DetectionMisuse Categories Automation (e-commerce)
AUC100
9
Misuse DetectionMisuse Categories Scam (Romance)
AUC100
9
Misuse DetectionMisuse Categories Scam (Tax Authority)
AUC99
9
Misuse DetectionMisuse Categories Scam (Racism)
AUC1
9
Misuse DetectionMisuse Categories Scam (Elections)
AUC0.99
9
Misuse DetectionMisuse Categories Psychological Harm (Anti-LGBTQ)
AUC100
9
Misuse DetectionMisuse Categories Psychological Harm (Delusional)
AUC99
9
Misuse DetectionMisuse Categories Cybercrime (SQL Injection)
AUC99
9
Misuse DetectionMisuse Categories Cybercrime (Phishing)
AUC0.99
9
Showing 10 of 10 rows