Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep research on BrowseComp-zh
Loading...
66.6
Accuracy
GLM-4.7-358B
26.872
37.186
47.5
57.814
Feb 2, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM-4.7-358B
Model Category=Large O...
2026.02
66.6
DeepSeek-V3.2-Thinking-685B
Model Category=Large O...
2026.02
65
GPT-5-high
Model Category=Closed-...
2026.02
63
Kimi-K2-Thinking-1T
Model Category=Large O...
2026.02
62.3
o3
Model Category=Closed-...
2026.02
58.1
RE-TRAC-30B-A3B
Model Category=Interme...
2026.02
57.3
Gemini-3-pro
Model Category=Closed-...
2026.02
51.6
MiniMax-M2-229B
Model Category=Large O...
2026.02
48.5
Tongyi-DeepResearch-30B-A3B
Model Category=Interme...
2026.02
46.7
IterResearch-30B-A3B
Model Category=Interme...
2026.02
45.2
WebSailor-V2-30B-A3B (RL)
Model Category=Interme...
2026.02
44.1
OpenAI DeepResearch
Model Category=Closed-...
2026.02
42.9
Claude-4.5-Sonnet
Model Category=Closed-...
2026.02
42.4
RE-TRAC-4B
Model Category=Compact...
2026.02
36.1
WebExplorer-8B
Model Category=Compact...
2026.02
32
InfoAgent-14B
Model Category=Compact...
2026.02
29.2
AgentCPM-Explore-4B
Model Category=Compact...
2026.02
29
NestBrowse-4B
Model Category=Compact...
2026.02
28.4
Feedback
Search any
task
Search any
task