| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Indirect Prompt Injection | LLM Behavior Subset 1 | IR99.8 | 24 | |
| LLM Behavior | LLM Behavior | Response Rate (RR)95 | 12 | |
| Prompt Injection Attack Success | LLM Behavior | Injection Rate (IR)100 | 10 | |
| Indirect Prompt Injection Attack Success Evaluation | LLM Behavior Goal-Distant | IRany100 | 5 | |
| Indirect Prompt Injection Attack Success Evaluation | LLM Behavior Goal-Adjacent | IRany100 | 5 |