Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep Search on BrowseComp-ZH
Loading...
63.7
Accuracy
TaS
21.892
32.746
43.6
54.454
Feb 6, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
TaS
Type=MA, Backbone=GPT-...
2026.02
63.7
GPT-5 High-Think
Type=N/A
2026.02
63
GPT-5 Medium-Think
Type=MA
2026.02
62.9
GPT-5 Medium-Think
Type=SA
2026.02
56.5
MiroThinker-v1.0-72B
Type=SA
2026.02
55.6
MiroThinker-v1.0-30B
Type=SA
2026.02
47.8
Tongyi DeepResearch (30B)
Type=SA
2026.02
46.7
OpenAI Deep Research
Type=N/A
2026.02
42.9
MiroThinker-v1.0-8B
Type=SA
2026.02
40.2
TaS
Type=MA, Backbone=Qwen...
2026.02
35.3
TaS
Type=MA, Backbone=Gemi...
2026.02
34.9
Qwen3-Max
Type=MA
2026.02
34.3
Claude-4-Sonnet (Thinking)
Type=SA
2026.02
29.1
Gemini-2.5-Flash
Type=MA
2026.02
28.4
Gemini-2.5-Pro
Type=SA
2026.02
27.8
Gemini-2.5-Flash
Type=SA
2026.02
26.6
Qwen3-Max
Type=SA
2026.02
23.5
Feedback
Search any
task
Search any
task