Prompt injection detection

Benchmarks

Dataset Name	SOTA Method	Metric
PopUp attack (top visited websites)	WARD-0.8b	Accuracy99.98	40	2mo ago
AgentDyn	PI-Hunter	Source Precision83.1	30	1mo ago
AgentDojo	PI-Hunter	Source Precision92.1	30	1mo ago
SafeGuardPI	IBM Granite Guardian	F1 Score93	15	5mo ago
QualifirePI	Apriel Guard	F1 Score87	15	5mo ago
DeepsetPI	Apriel Guard	F1 Score71	15	5mo ago
SCOUT-450	Attention tracker	ASR (hid)0	13	1mo ago
EIA		Recall97.39	13	3mo ago
VPI-Bench	WebAgentGuard-8B	Recall87.58	13	3mo ago
NotInject	LlamaGuard 3	FPR0.29	12	1mo ago
MIPIAD aggregate over English and Bangla (test)	Hybrid (XLPID+TF-IDF)	Accuracy (Acc)89	11	2mo ago
WAInjectBench 1.0 (test)	GPT-4o-prompt	TPR (EIA)80	11	2mo ago
Summ	DataSentinel	Detection Rate (TPR/FPR)100	8	1mo ago
SD	DataSentinel	Detection Rate (TPR/FPR)100	8	1mo ago
SA	DataSentinel	Detection Rate (TPR/FPR)100	8	1mo ago
NLI	DataSentinel	Detection Rate (TPR/FPR)100	8	1mo ago
HD	DataSentinel	Detection Rate100	8	1mo ago
GC	DataSentinel	Detection Rate100	8	1mo ago
DSD	DataSentinel	Detection Rate (TPR/FPR)100	8	1mo ago
Web Direct Prompt Injection		FPR0	7	5mo ago
Teaching Direct Prompt Injection		FPR0	7	5mo ago
Media Direct Prompt Injection		FPR0	7	5mo ago
Shopping Direct Prompt Injection		FPR0	7	5mo ago
Messaging Direct Prompt Injection		FPR0	7	5mo ago
Language Direct Prompt Injection	AlignSentinel	FPR0	7	5mo ago

Showing 25 of 55 rows