Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Attack Success Rate on AdvBench LLaMA-3.1-70B
Loading...
39.7
ASR (SMO, GPT-4o)
HMNS
4.652
13.751
22.85
31.949
Apr 11, 2026
ASR (SMO, GPT-4o)
ASR (SMO, GPT-5)
ASR (DPP, GPT-4o)
ASR (DPP, GPT-5)
ASR (RPO, GPT-4o)
ASR (RPO, GPT-5)
ASR (PAR, GPT-4o)
ASR (PAR, GPT-5)
ASR (PAT, GPT-4o)
ASR (PAT, GPT-5)
ASR (SAF, GPT-4o)
ASR (SAF, GPT-5)
Average ASR (GPT-4o)
Average ASR (GPT-5)
Updated 5d ago
Evaluation Results
Method
Method
Links
ASR (SMO, GPT-4o)
ASR (SMO, GPT-5)
ASR (DPP, GPT-4o)
ASR (DPP, GPT-5)
ASR (RPO, GPT-4o)
ASR (RPO, GPT-5)
ASR (PAR, GPT-4o)
ASR (PAR, GPT-5)
ASR (PAT, GPT-4o)
ASR (PAT, GPT-5)
ASR (SAF, GPT-4o)
ASR (SAF, GPT-5)
Average ASR (GPT-4o)
Average ASR (GPT-5)
HMNS
2026.04
39.7
16.2
52.9
39.2
83
62.1
63.7
36.6
47.8
30
46.8
36.6
55.6
36.8
ArrAttack
2026.04
33.7
10.2
46.9
33.2
77
56.1
57.7
30.6
41.8
24
40.8
30.6
49.6
30.8
Tempest
2026.04
24
18
40
28
68
50
50
26
35
20
33
26
41.7
28
AutoDAN
2026.04
9
7
20
15
32
26
20
16
18
14
12
9
18.5
14.5
FITD
2026.04
6
3
8
5
15
10
10
6
8
5
7
5
9
5.7
Feedback
Search any
task
Search any
task