LLM Behavior

Benchmarks

Task Name	Dataset Name	SOTA Result
Indirect Prompt Injection	LLM Behavior Subset 1	IR99.8	24
LLM Behavior	LLM Behavior	Response Rate (RR)95	12
Prompt Injection Attack Success	LLM Behavior	Injection Rate (IR)100	10
Indirect Prompt Injection Attack Success Evaluation	LLM Behavior Goal-Distant	IRany100	5
Indirect Prompt Injection Attack Success Evaluation	LLM Behavior Goal-Adjacent	IRany100	5

Showing 5 of 5 rows