Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Robustness on AutoDAN Adv single-turn attack
Loading...
81
ASR
No Defense
-3.24
18.63
40.5
62.37
Jun 1, 2026
ASR
Updated 1d ago
Evaluation Results
Method
Method
Links
ASR
No Defense
Model=Qwen2.5-7B
2026.06
81
No Defense
Model=Llama3-8B
2026.06
75
PROACT
Model=Qwen2.5-7B
2026.06
0
SAGE
Model=Qwen2.5-7B
2026.06
0
THRD
Model=Qwen2.5-7B
2026.06
0
PROACT
Model=Llama3-8B
2026.06
0
SAGE
Model=Llama3-8B
2026.06
0
THRD
Model=Llama3-8B
2026.06
0
Feedback
Search any
task
Search any
task