Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Malicious Prompt Detection on Llama-2 7B-Chat (GCG Attacks)
Loading...
100
Detection Accuracy
JoPA
95
97.5
100
102.5
May 30, 2024
Detection Accuracy
ASR
Updated 4d ago
Evaluation Results
Method
Method
Links
Detection Accuracy
ASR
JoPA
Backbone=Llama-2 (7B-C...
2024.05
100
3
Feedback
Search any
task
Search any
task