Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PS-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety EvaluationPS-Bench base setting (test)
ASR (Hate Speech)18
30
Safety EvaluationPS-Bench
ASR (Hate Speech)29.1
7
Showing 2 of 2 rows