Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Prompt Injection Attack benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Prompt Injection Attack
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
InjecAgent
Direct
ASR @ 1 Attempt
0
32
2mo ago
Direct Scenario
StruQ
ASR
2.88
28
1mo ago
SQuAD v2
PAIR
ASR
0
27
1mo ago
AgentDojo
PISmith
ASR@1
64
21
8d ago
Tool-Completion (TCA)
CAHL
ASR
0.12
14
3mo ago
NavGPT (test)
None
Navigation Error
7.07
12
3mo ago
GovReport
Heuristic
Attack Success Rate (ASR)
0
11
1mo ago
NarrativeQA
nanoGCG-OPT
ASR
86
11
1mo ago
MuSiQue
nanoGCG-OPT
Attack Success Rate (ASR)
98
9
1mo ago
AgentDojo Slack suite
AgentDojo Static Injection
Baseline ASR
14.4
9
3mo ago
Tool-Completion Naive-e
CAHL
ASR
15
7
3mo ago
Tool-Completion TCA-e
CAHL
ASR
56
7
3mo ago
Musique
nanoGCG-OPT
ASR
92
6
1mo ago
AgentDojo 13 non-agent benchmarks
TAP
Training Queries
0
6
2mo ago
Outdoor Navigation (test)
None
NE
0
6
3mo ago
SQuAD
Vanilla
Attack Success Rate (ASR)
10.32
4
22d ago
EHRAgent
nanoGCG-OPT
ASR
100
4
1mo ago
GovReport
nanoGCG
ASR
100
4
1mo ago
Long Code Arena (LCA) project-level code completion 16K token contexts first 50 repositories medium context set
nanoGCG-OPT
Attack Success Rate (ASR)
80
4
1mo ago
Real-world Overall (test)
PI3D
ASR
64.8
2
3mo ago
Real-world Outdoor (test)
PI3D
ASR
58.3
2
3mo ago
Office Real-world (test)
PI3D
ASR
54.2
2
3mo ago
Real-world Home (test)
PI3D
ASR
88.3
2
3mo ago
Showing 23 of 23 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs