Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

JailbreakLLMs

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety EvaluationJailbreakLLMs Orig.
Unsafe Rate0
19
Safety EvaluationJailbreakLLMs Noise
Unsafe Rate0.38
13
Safety EvaluationJailbreakLLMs Blank
Unsafe Rate0
13
Showing 3 of 3 rows