SOTA Malicious Prompt Detection on Llama-2 7B-Chat (GCG Attacks) and PapersWithCode

100Detection Accuracy