Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Malicious Prompt Detection on Llama-2 7B-Chat (GCG Attacks)
Loading...
100
Detection Accuracy
JoPA
95
97.5
100
102.5
May 30, 2024
Detection Accuracy
ASR
Updated 1mo ago
Evaluation Results
Method
Method
Links
Detection Accuracy
ASR
JoPA
Backbone=Llama-2 (7B-C...
2024.05
100
3
Feedback
Search any
task
Search any
task