| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | TQA | Accuracy73.8 | 34 | |
| Question Answering | TQA poison @ Position 10, k=10 (test) | Robustness Accuracy71 | 15 | |
| Question Answering | TQA poison @ Position 1, k=10 (test) | Robustness Accuracy66.4 | 15 | |
| Open-Domain Question Answering | TQA (test) | EM66.45 | 11 | |
| Information Retrieval | TQA (test) | Recall@578.3 | 8 | |
| Retrieval-Augmented Generation | TQA open | Accuracy46.24 | 8 | |
| Context Compression & QA | TQA (val) | EM59.7 | 6 |