Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Deep Research on Xbench DeepResearch
Loading...
67
Accuracy
OpenAI o4-mini
37.88
45.44
53
60.56
Feb 2, 2026
Feb 9, 2026
Feb 16, 2026
Feb 23, 2026
Mar 2, 2026
Mar 9, 2026
Mar 17, 2026
Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Accuracy
OpenAI o4-mini
type=Foundation Models...
2026.03
67
OPENRESEARCHER
2026.03
65
Claude-4-Sonnet
type=Foundation Models...
2026.03
64
DeepSeek-R1
type=Foundation Models...
2026.03
55
Nemotron-3-Nano
type=Foundation Models...
2026.03
55
WebSailor-72B
type=Deep Research Agents
2026.03
55
DeepMiner-32B
type=Deep Research Agents
2026.03
53
Kimi-K2
type=Foundation Models...
2026.03
50
Live-Evo
Base Model=GPT-4.1-mini
2026.02
46
MiroFlow
Base Model=GPT-4.1-mini
2026.02
45
Qwen-DeepResearch
Base Model=GPT-4.1-mini
2026.02
43
ASearcher-QwQ-32B
type=Deep Research Agents
2026.03
42
ReMem
Base Model=GPT-4.1-mini
2026.02
40
WebDancer-QwQ-32B
type=Deep Research Agents
2026.03
39
Feedback
Search any
task
Search any
task