Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prompt Injection Detection on NQ simplified
Loading...
6
Naïve RL Score
PromptArmor
2.52
26.01
49.5
72.99
Feb 2, 2026
Naïve RL Score
Naïve ES Score
Ignore RL Score
Ignore ES Score
Escape RL Score
Escape ES Score
Composite RL Score
Composite ES Score
Multi RL Score
Multi ES Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Naïve RL Score
Naïve ES Score
Ignore RL Score
Ignore ES Score
Escape RL Score
Escape ES Score
Composite RL Score
Composite ES Score
Multi RL Score
Multi ES Score
PromptArmor
2026.02
6
8
19
24
7
10
2
4
1
3
RedVisor
Backbone=Mistral-7B
2026.02
89
91
92
94
90
91
74
75
71
71
PromptLocate
2026.02
91
93
94
95
92
93
68
65
62
59
RedVisor
Backbone=Qwen-2.5-7B
2026.02
92
94
95
96
92
94
79
80
77
76
RedVisor
Backbone=Llama-3-8B
2026.02
93
95
95
97
93
94
80
78
78
75
Feedback
Search any
task
Search any
task