Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Attack Success Rate on AdvBench LLaMA-2-7B-Chat
Loading...
40
ASR (SMO, GPT-4o)
HMNS
8.8
16.9
25
33.1
Apr 11, 2026
ASR (SMO, GPT-4o)
ASR (SMO, GPT-5)
ASR (DPP, GPT-4o)
ASR (DPP, GPT-5)
ASR (RPO, GPT-4o)
ASR (RPO, GPT-5)
ASR (PAR, GPT-4o)
ASR (PAR, GPT-5)
ASR (PAT, GPT-4o)
ASR (PAT, GPT-5)
ASR (SAF, GPT-4o)
ASR (SAF, GPT-5)
ASR (Avg, GPT-4o)
ASR (Avg, GPT-5)
Updated 5d ago
Evaluation Results
Method
Method
Links
ASR (SMO, GPT-4o)
ASR (SMO, GPT-5)
ASR (DPP, GPT-4o)
ASR (DPP, GPT-5)
ASR (RPO, GPT-4o)
ASR (RPO, GPT-5)
ASR (PAR, GPT-4o)
ASR (PAR, GPT-5)
ASR (PAT, GPT-4o)
ASR (PAT, GPT-5)
ASR (SAF, GPT-4o)
ASR (SAF, GPT-5)
ASR (Avg, GPT-4o)
ASR (Avg, GPT-5)
HMNS
2026.04
40
25
54
41
82
61
64
45
48
33
47
34
55.8
39.8
ArrAttack
2026.04
34
22
48
36
74
55
58
41
42
30
40
29
49.3
35.5
Tempest
2026.04
26
19
42
31
68
50
52
38
36
25
33
24
42.8
31.2
AutoDAN
2026.04
15
11
24
18
38
28
28
20
22
16
18
12
24.2
17.5
FITD
2026.04
10
7
12
9
20
14
14
10
12
8
11
7
13.2
9.2
Feedback
Search any
task
Search any
task