Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Research on DeepSearchQA
Loading...
80
Score
Claude-4.5-Opus
15.312
32.106
48.9
65.694
Apr 21, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
Claude-4.5-Opus
Model Category=Foundat...
2026.04
80
GPT-5 High
Model Category=Foundat...
2026.04
79
Kimi-K2.5
Model Category=Foundat...
2026.04
77.1
Gemini-3-Pro
Model Category=Foundat...
2026.04
76.9
DeepSeek-V3.2
Model Category=Foundat...
2026.04
60.9
DR-Venus-4B-RL
Model Category=Trained...
2026.04
39.6
DR-Venus-4B-SFT
Model Category=Trained...
2026.04
37.7
AgentCPM-Explore-4B
Model Category=Trained...
2026.04
32.8
WebExplorer-8B-RL
Model Category=Trained...
2026.04
17.8
Feedback
Search any
task
Search any
task