Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Jailbreak Attack on Advbench Llama2-70B Guard 100 prompts Original

0ASR

No Attack

-3.821.8547.573.15Feb 24, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.02
0
2024.02
0
2024.02
1
2024.02
2
2024.02
5
2024.02
11
2024.02
16
2024.02
21
2024.02
50
2024.02
55
2024.02
56
2024.02
59
2024.02
64
2024.02
73
2024.02
74
2024.02
74
2024.02
80
2024.02
86
2024.02
88
2024.02
92
2024.02
95