| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | DeepSearchQA | Accuracy82.67 | 19 | |
| DeepSearchQA | DeepSearchQA | Accuracy76.67 | 19 | |
| Long-horizon agentic task | DeepSearchQA | Performance66 | 18 | |
| Reasoning | DeepSearchQA 2026 (test) | Score24.1 | 15 | |
| Reasoning | DeepSearchQA | Score24.1 | 15 | |
| Agentic Search | DeepSearchQA | Accuracy90 | 14 | |
| Deep Research | DeepSearchQA | Score80 | 9 | |
| Search-based Question Answering | DeepSearchQA | Pass64 | 4 |