Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Search on BrowseComp-ZH (test)
Loading...
68.7
Accuracy
OpenAI-o3
26.684
37.592
48.5
59.408
May 5, 2026
Accuracy
Updated 28d ago
Evaluation Results
Method
Method
Links
Accuracy
OpenAI-o3
# Samples=?, Training=...
2026.05
68.7
Gemini-3-pro
# Samples=?, Training=...
2026.05
66.8
GLM-4.7-357B
# Samples=?, Training=...
2026.05
66.6
DeepSeek-V3.2-671B
# Samples=?, Training=...
2026.05
65
GPT-5-High
# Samples=?, Training=...
2026.05
63
OpenSeeker-v2-30B-SFT
# Samples=10.6 k, Trai...
2026.05
58.1
RedSearcher-30B
# Samples=?, Training=...
2026.05
49.8
GLM-4.6-357B
# Samples=?, Training=...
2026.05
49.5
DeepSeek-V3.1-671B
# Samples=?, Training=...
2026.05
49.2
Minimax-M2-230B
# Samples=?, Training=...
2026.05
48.5
OpenSeeker-v1-30B-SFT
# Samples=11.7 k, Trai...
2026.05
48.4
Tongyi DeepResearch
# Samples=?, Training=...
2026.05
46.7
WebSailor-V2-30B-RL
# Samples=?, Training=...
2026.05
44.1
OpenAI Deep Research
# Samples=?, Training=...
2026.05
42.9
Claude-4.5-Sonnet
# Samples=?, Training=...
2026.05
42.4
Claude-4-Opus
# Samples=?, Training=...
2026.05
37.4
WebSailor-V2-30B-SFT
# Samples=?, Training=...
2026.05
28.3
Feedback
Search any
task
Search any
task