Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Research Evaluation on Aggregate
Loading...
63.97
WQ
Smolagents Open DR
61.63
62.2375
62.845
63.4525
Feb 21, 2026
WQ
Factuality
CI
DA
KIC
RQ
Updated 1mo ago
Evaluation Results
Method
Method
Links
WQ
Factuality
CI
DA
KIC
RQ
Smolagents Open DR
2026.02
63.97
58.15
4.78
17.96
75.95
69.16
LangChain Open DR
2026.02
62.17
44.64
15.92
85.08
65.96
57.28
Tongyi Deep Research
2026.02
61.72
55.09
1.03
3.38
57.95
45.48
Feedback
Search any
task
Search any
task