Share your thoughts, 1 month free Claude Pro on usSee more

Deep Information Retrieval and Research on xbench DeepSearch

77.8Avg@8

OpenAI-GPT-5-high

Updated 3mo ago

Evaluation Results

Method	Links
OpenAI-GPT-5-high 2025.11		77.8
MiroThinker-v1.0-72B 2025.11		77.8
Tongyi-DeepResearch-30B 2025.11		75
Minimax-M2 2025.11		72
DeepSeek-V3.1 2025.11		71
DeepSeek-V3.2 2025.11		71
MiroThinker-v1.0-30B 2025.11		70.6
GLM-4.6 2025.11		70
Kimi-Researcher 2025.11		69
OpenAI-o3 2025.11		67
Claude-4.5-Sonnet 2025.11		66
Claude-4-Sonnet 2025.11		64.6
DeepMiner-32B-RL 2025.11		62
Kimi-K2-0905 2025.11		61
MiroThinker-v1.0-8B 2025.11		60.6
WebExplorer-8B-RL 2025.11		53.7