Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Malicious Prompt Detection on Llama-2 7B-Chat (GCG Attacks)

100Detection Accuracy

JoPA

9597.5100102.5May 30, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.05
1003