Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Prompt injection detection benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Prompt injection detection
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
SafeGuardPI
IBM Granite Guardian
F1 Score
93
15
1mo ago
QualifirePI
Apriel Guard
F1 Score
87
15
1mo ago
DeepsetPI
Apriel Guard
F1 Score
71
15
1mo ago
PopUp attack (top visited websites)
WebAgentGuard-8B
Accuracy
91.13
13
4d ago
EIA
GPT-4.1
Recall
97.39
13
4d ago
VPI-Bench
WebAgentGuard-8B
Recall
87.58
13
4d ago
Web Direct Prompt Injection
Prompt-Guard2
FPR
0
7
1mo ago
Teaching Direct Prompt Injection
Prompt-Guard2
FPR
0
7
1mo ago
Media Direct Prompt Injection
Prompt-Guard2
FPR
0
7
1mo ago
Shopping Direct Prompt Injection
Prompt-Guard2
FPR
0
7
1mo ago
Messaging Direct Prompt Injection
Prompt-Guard2
FPR
0
7
1mo ago
Language Direct Prompt Injection
AlignSentinel
FPR
0
7
1mo ago
Entertainment Direct Prompt Injection
AlignSentinel
FPR
0
7
1mo ago
Coding Direct Prompt Injection
Prompt-Guard2
FPR
0
7
1mo ago
AlignSentinel Evaluation Dataset (Indirect Prompt Injection Attack)
Prompt-Guard2
FPR (Coding)
0
7
1mo ago
OpenPromptInjection
Enc-first
Naive FPR
0
6
1mo ago
IHEval Tool-use
AlignSentinel (Enc-first)
FPR
0
6
1mo ago
IHEval Rule-following
AlignSentinel (Enc-first)
FPR
0.01
6
1mo ago
NQ simplified
PromptArmor
Naïve RL Score
6
5
1mo ago
AgentDojo
RedVisorLlama
Banking RL
97
5
1mo ago
NeuralExec (test)
CONTEXTCITE
Detection Accuracy (Top-1)
98.8
3
1mo ago
Showing 21 of 21 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs