Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Defending against gradient-based attacks on Llama3 GCG Attack (test)

9.61ASR

Ours-Fakecom-t

6.51427.41248.3169.208Nov 1, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.11
9.61
2024.11
12.01
2024.11
13.94
2024.11
19.23
2024.11
19.23
2024.11
19.71
2024.11
24.51
2024.11
28.84
2024.11
40.38
2024.11
87.01