| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Comparative Performance Evaluation | DeepConsult | Win Rate77.21 | 24 | |
| Open-Ended Deep Research | DeepConsult | Win Rate64.42 | 9 | |
| Deep Research | DeepConsult (test) | Win Rate80 | 8 | |
| Multimodal Report Generation | DeepConsult | Instruction Adherence Score13.73 | 7 | |
| Agentic Writing | DeepConsult | Pass@157.2 | 5 |