Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HarmBench and AdvBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak DefenseHarmBench and AdvBench (test)
GCG Score91.2
44
Generative AI Output SafetyHarmBench and AdvBench (test)
Safe Rate82.88
8
Showing 2 of 2 rows