Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Expert-Level Reasoning on XBench-DeepSearch 1.0 (test)
Loading...
0.9
Inference Accuracy
ReThinker
0.484
0.592
0.7
0.808
Feb 4, 2026
Inference Accuracy
Updated 3mo ago
Evaluation Results
Method
Method
Links
Inference Accuracy
ReThinker
Model Category=Inferen...
2026.02
0.9
Gemini-3-Pro
Model Category=Foundat...
2026.02
0.87
ReThinker
Model Category=Inferen...
2026.02
0.78
GPT-5-high
Model Category=Foundat...
2026.02
0.778
Tongyi DeepResearch
Model Category=Inferen...
2026.02
0.75
DeepSeek-V3.2
Model Category=Foundat...
2026.02
0.71
MiroThinker-v1.0
Model Category=Inferen...
2026.02
0.706
GLM-4.6
Model Category=Foundat...
2026.02
0.7
Kimi Researcher
Model Category=Inferen...
2026.02
0.69
Claude-4.5-Sonnet
Model Category=Foundat...
2026.02
0.66
WebExplorer
Model Category=Inferen...
2026.02
0.537
Kimi K2
Model Category=Foundat...
2026.02
0.5
Feedback
Search any
task
Search any
task