Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Public Prompt Harmfulness Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Prompt Harmfulness ClassificationPublic Prompt Harmfulness Benchmarks (ToxicChat, OpenAI Moderation, AegisSafetyTest, SimpleSafetyTests, HarmBenchPrompt)
ToxiC Score73
7
Showing 1 of 1 rows