Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PS-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety EvaluationPS-Bench base setting (test)
ASR (Hate Speech)18
30
Safety EvaluationPS-Bench
ASR (Hate Speech)29.1
7
Showing 2 of 2 rows