| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long-context Question Answering | DetectiveQA-En | Accuracy75.5 | 32 | |
| Long-context Question Answering | DetectiveQA-Zh | Accuracy0.8417 | 32 | |
| Retrieval | DetectiveQA | Recall@332.22 | 8 | |
| Retrieval | DetectiveQA-ZH | R@346.8 | 6 | |
| Question Answering | DetectiveQA | Accuracy67.25 | 6 |