| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Defense against Indirect Prompt Injection | Filtered QA dataset | ASR (Naive)97.65 | 30 | |
| Question Answering | QA dataset Reverse direction | Exact Match Accuracy87 | 2 | |
| Question Answering | QA dataset Same direction | Exact Match Accuracy100 | 2 |