| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Financial Question Answering | FiQA | Accuracy82.7 | 85 | |
| RAG Leakage Attack | FIQA | CCL66.8 | 72 | |
| Retrieval Attack Defense | FiQA | ASR0 | 70 | |
| End-to-End Defense in RAG | FiQA | ASR0 | 63 | |
| Information Retrieval | FIQA BEIR (test) | nDCG@1093.19 | 44 | |
| Adversarial Attack on RAG | FiQA | SASR98.47 | 24 | |
| Information Retrieval | FIQA | MRR54 | 22 | |
| Information Retrieval | FiQA | NDCG@10 (Dense)0.479 | 21 | |
| Passage Reranking | FiQA BEIR | NDCG@1029.88 | 19 | |
| Information Retrieval | FiQA | Faithfulness79 | 18 | |
| Information Retrieval | FiQA | nDCG@1055.6 | 16 | |
| Information Retrieval | FiQA benign retrieval | Success Rate (SR)79 | 16 | |
| Privacy-utility tradeoff | FIQA | Leakage8.48 | 16 | |
| Document Retrieval | FiQA BEIR 2018 | Delta nDCG@101.36 | 15 | |
| Information Retrieval | FiQA 2018 (test) | NDCG@100.6197 | 14 | |
| Question Answering Retrieval | FiQA | nDCG@1038.6 | 9 | |
| Vector Linking | FiQA | Precision79.8 | 8 | |
| AI-generated text detection | FiQA | AUROC79.67 | 7 | |
| RAG Utility Evaluation | FiQA | Hit Rate49.4 | 7 | |
| Question Answering | FiQA | Answerability Rate100 | 6 | |
| Financial Question Answering | FIQA Synthetic NIID 1.0 (test) | Win Rate82.1 | 6 | |
| Financial Question Answering | FIQA Synthetic IID 1.0 (test) | Win Rate72.1 | 6 | |
| Adversarial Attack | FiQA 2018 | Contriever86.54 | 6 | |
| Information Retrieval | FiQA 2018 | Accuracy42.05 | 6 | |
| FAQ matching | FiQA | Top-1 Accuracy61.2 | 5 |