Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak on AdvBench RPO defense (full)
Loading...
92
ASR
TAO-Attack
85.76
87.38
89
90.62
Mar 3, 2026
ASR
Iterations
Updated 1mo ago
Evaluation Results
Method
Method
Links
ASR
Iterations
TAO-Attack
Target Model=Llama-2-7...
2026.03
92
71
I-GCG
Target Model=Llama-2-7...
2026.03
86
133
Feedback
Search any
task
Search any
task