| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| End-to-end Question Answering | MultiHopRAG (test val) | Accuracy47.14 | 20 | |
| Multi-session Retrieval-Augmented Generation | MultihopRAG (test) | F1 Score64.4 | 12 | |
| Multi-hop Reasoning | MultiHopRAG | EM89.6 | 11 | |
| Information Retrieval | MultiHopRAG (test) | MRR@1063.58 | 11 | |
| Query-relevant Extraction | MultiHopRAG | F1 Score32 | 8 | |
| Main Content Extraction | MultiHopRAG | F1 Score87.4 | 8 |