Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SafeBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak AttackSafeBench
ASR0
112
Jailbreak AttackSafeBench Tiny
ASR100
24
Jailbreak attackSafebench (test)
IA ASR92
20
Jailbreak AttackSafeBench
ADU Success Rate100
16
Multimodal Safety EvaluationSafeBench
FS ASR3.26
4
JailbreakingSafeBench evaluated on OpenAI-o1
FS34.8
1
Showing 6 of 6 rows