Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General AI Assistant Reasoning on BrowseComp-zh (BC-zh)
Loading...
42.9
Pass@1 Accuracy
OPENAI DEEPRESEARCH
2.34
12.87
23.4
33.93
Mar 7, 2026
Pass@1 Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
OPENAI DEEPRESEARCH
Category=CLOSED-SOURCE
2026.03
42.9
MIRO-30B + WEDAS
Category=OPEN-SOURCE
2026.03
41
MIRO-30B + MIROFLOW
Category=OPEN-SOURCE
2026.03
34
WEBSAILOR-72B
Category=OPEN-SOURCE
2026.03
30.1
GPT-5-MINI + MIROFLOW
Category=OPEN-SOURCE
2026.03
28
DOUBAO-DEEPTHINK
Category=CLOSED-SOURCE
2026.03
26
WEBSAILOR-32B
Category=OPEN-SOURCE
2026.03
25.5
GPT-5-MINI + WEDAS
Category=OPEN-SOURCE
2026.03
25
WEBDANCER-QWQ-32B
Category=OPEN-SOURCE
2026.03
18
SEARCH-O1-32B
Category=OPEN-SOURCE
2026.03
17.9
ASEARCHER-WEB-32B
Category=OPEN-SOURCE
2026.03
15.6
O4-MINI
Category=DIRECT INFERENCE
2026.03
15.2
GPT-4.1
Category=DIRECT INFERENCE
2026.03
14.4
GROK-DEEPRESEARCH
Category=CLOSED-SOURCE
2026.03
12.9
QWQ-32B
Category=DIRECT INFERENCE
2026.03
10
WEBTHINKER-32B-RL
Category=OPEN-SOURCE
2026.03
7.3
QWEN-2.5-72B
Category=DIRECT INFERENCE
2026.03
7
GPT-4O
Category=DIRECT INFERENCE
2026.03
6.2
QWEN-2.5-32B
Category=DIRECT INFERENCE
2026.03
3.9
Feedback
Search any
task
Search any
task