Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Adversarial Prompt Detection on 10,000 Adversarial Prompts
Loading...
83
Detection Rate
Our Framework
63.24
68.37
73.5
78.63
Mar 17, 2026
Detection Rate
False Positive Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Detection Rate
False Positive Rate
Our Framework
Processing Time=15ms
2026.03
83
5
SecurityLingua
Processing Time=12ms
2026.03
80
7
PromptShield
Processing Time=20ms
2026.03
78
8
Commercial Moderation APIs
Processing Time=180-220ms
2026.03
67
11
Open-Source Detoxify
Processing Time=45ms
2026.03
64
9
Feedback
Search any
task
Search any
task