Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Research on Deep Research tasks (test)
Loading...
3.7
Interest Level
Agentic Reasoning
1.1
1.775
2.45
3.125
Feb 7, 2025
Interest Level
Organization
Relevance
Coverage
Updated 1mo ago
Evaluation Results
Method
Method
Links
Interest Level
Organization
Relevance
Coverage
Agentic Reasoning
underlying reasoning m...
2025.02
3.7
4.6
4.2
4.1
Gemini-DR+
2025.02
3.2
2.5
2.3
3
STORM
2025.02
2.9
3.2
2.9
3.7
Search-O1
2025.02
2.5
2.8
2.1
3.2
RAgent
2025.02
1.6
2.3
1.6
2.6
RAG
2025.02
1.4
2.1
1.9
2.3
Direct Gen
2025.02
1.2
1.6
1.2
1.7
Feedback
Search any
task
Search any
task