Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Malicious Prompt Detection on Llama-2 Prompt with Random Search 7B-Chat

91Detection Accuracy

JoPA

86.4588.7259193.275May 30, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.05
9190