Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Jailbreak Attack on Llama2-7b five finetuned variants
Loading...
0
Average ASR
PIF
-2.504
14.398
31.3
48.202
Dec 14, 2025
Average ASR
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average ASR
PIF
Attacker knowledge (Th...
2025.12
0
DAN
Attacker knowledge (Th...
2025.12
0
PAIR
Attacker knowledge (Th...
2025.12
0
L4A
Attacker knowledge (Th...
2025.12
1.8
DI-GCG
Attacker knowledge (Th...
2025.12
3.2
AutoDan (adaptation)
Attacker knowledge (Th...
2025.12
6.2
AutoDan (white)
Attacker knowledge (Th...
2025.12
15.8
LSGM_LILA (adaptation)
Attacker knowledge (Th...
2025.12
16
SEA
Attacker knowledge (Th...
2025.12
20.4
TUJA (adaptation)
Attacker knowledge (Th...
2025.12
24.8
GCG (adaptation)
Attacker knowledge (Th...
2025.12
25.6
SCAV (adaptation)
Attacker knowledge (Th...
2025.12
28
Guiding-GCG
Attacker knowledge (Th...
2025.12
33.2
GCG Ensemble
Attacker knowledge (Th...
2025.12
40.2
GCG (white)
Attacker knowledge (Th...
2025.12
47.4
PGP
Attacker knowledge (Th...
2025.12
62.6
Feedback
Search any
task
Search any
task