| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| DeepResearch Bench | Comprehensiveness52.84 | 81 | 5d ago | ||
| PDR-Bench | IntentRL clarify | P-Score7.21 | 22 | 3mo ago | |
| Rigorous Bench | CollabLLM clarify | Quality0.6257 | 22 | 3mo ago | |
| QRC-Eval | Mind2Report | Relevance75.42 | 12 | 3mo ago | |
| MMR Bench+ | ViDR (GPT-5.2) | Informativeness4.15 | 9 | 20d ago | |
| DeepResearchGym Commercial 100 | KPR80.55 | 9 | 3mo ago | ||
| Top-down setting 1.0 (test) | Nomad | Numeric Grounding66.3 | 3 | 2mo ago |