| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | TQA (test) | AUROC90.2 | 90 | |
| Question Answering | TQA | Absolute Execution Time Overhead (s)0.173 | 90 | |
| Question Answering | TQA | PRR86.1 | 90 | |
| Question Answering | TQA | Accuracy92.3 | 74 | |
| Knowledge gap detection | TQA | Accuracy83.2 | 18 | |
| Question Answering | TQA poison @ Position 10, k=10 (test) | Robustness Accuracy71 | 15 | |
| Question Answering | TQA poison @ Position 1, k=10 (test) | Robustness Accuracy66.4 | 15 | |
| Inference Efficiency | TQA | Relative Execution Time Overhead0.05 | 12 | |
| Open-Domain Question Answering | TQA (test) | EM66.45 | 11 | |
| Information Retrieval | TQA (test) | Recall@578.3 | 8 | |
| Retrieval-Augmented Generation | TQA open | Accuracy46.24 | 8 | |
| Context Compression & QA | TQA (val) | EM59.7 | 6 |