Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

JBB-Behaviors

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak DefenseJBB-Behaviors
ASR0
101
Jailbreak AttackJBB-Behaviors
Rule-Judge Score100
56
Jailbreak RobustnessJBB-Behaviors (test)
ASR0
24
Robustness against priming vulnerabilityJBB-Behaviors (test)
ASR (Guardrail Model)0
20
Jailbreak Attack RobustnessJBB-Behaviors
ASR (PAIR)10
18
Jailbreak RobustnessJBB-Behaviors
ASR (PAIR, Guardrail Model)0.3
18
Safety EvaluationJBB-Behaviors
Safety Score99.3
9
Showing 7 of 7 rows