| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long-context Question Answering | DetectiveQA-En | Accuracy75.5 | 38 | |
| Long-context Question Answering | DetectiveQA-Zh | Accuracy80 | 38 | |
| logical reasoning | DetectiveQA | Accuracy (DetectiveQA)88.31 | 24 | |
| Story Question Answering | DetectiveQA | Accuracy82.3 | 12 | |
| Retrieval | DetectiveQA | Recall@332.22 | 8 | |
| Retrieval | DetectiveQA-ZH | R@346.8 | 6 | |
| Question Answering | DetectiveQA | Accuracy67.25 | 6 |