| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | HQA | EM0.386 | 28 | |
| Knowledge-Intensive Reasoning | HQA | Average Score87 | 18 | |
| Question Answering | HQA (val) | EM35.2 | 14 | |
| Question Answering | HQA (in-domain) | EM39.6 | 14 | |
| Information Retrieval | HQA (test) | Recall@557.7 | 7 | |
| Context Compression & QA | HQA (val) | EM30.4 | 6 |