Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HarmfulQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Harmlessness evaluationHarmfulQ (test)
Harmlessness Fraction100
7
Safety EvaluationHarmfulQ
ASR1.5
6
JailbreakHarmfulQ
ASR18
3
Showing 3 of 3 rows