Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Prompt Injection Detection on NQ simplified
Loading...
6
Naïve RL Score
PromptArmor
2.52
26.01
49.5
72.99
Feb 2, 2026
Naïve RL Score
Naïve ES Score
Ignore RL Score
Ignore ES Score
Escape RL Score
Escape ES Score
Composite RL Score
Composite ES Score
Multi RL Score
Multi ES Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Naïve RL Score
Naïve ES Score
Ignore RL Score
Ignore ES Score
Escape RL Score
Escape ES Score
Composite RL Score
Composite ES Score
Multi RL Score
Multi ES Score
PromptArmor
2026.02
6
8
19
24
7
10
2
4
1
3
RedVisor
Backbone=Mistral-7B
2026.02
89
91
92
94
90
91
74
75
71
71
PromptLocate
2026.02
91
93
94
95
92
93
68
65
62
59
RedVisor
Backbone=Qwen-2.5-7B
2026.02
92
94
95
96
92
94
79
80
77
76
RedVisor
Backbone=Llama-3-8B
2026.02
93
95
95
97
93
94
80
78
78
75
Feedback
Search any
task
Search any
task