Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General AI Assistant Reasoning on BrowseComp
Loading...
51.5
Pass@1 Accuracy
OPENAI DEEPRESEARCH
-1.54
12.23
26
39.77
Mar 7, 2026
Pass@1 Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
OPENAI DEEPRESEARCH
Category=CLOSED-SOURCE
2026.03
51.5
MIRO-30B + WEDAS
Category=OPEN-SOURCE
2026.03
26
GPT-5-MINI + WEDAS
Category=OPEN-SOURCE
2026.03
17
MIRO-30B + MIROFLOW
Category=OPEN-SOURCE
2026.03
17
GPT-5-MINI + MIROFLOW
Category=OPEN-SOURCE
2026.03
15
WEBSAILOR-72B
Category=OPEN-SOURCE
2026.03
12
WEBSAILOR-32B
Category=OPEN-SOURCE
2026.03
10.5
O4-MINI
Category=DIRECT INFERENCE
2026.03
6.1
ASEARCHER-WEB-32B
Category=OPEN-SOURCE
2026.03
5.2
WEBDANCER-QWQ-32B
Category=OPEN-SOURCE
2026.03
3.8
SEARCH-O1-32B
Category=OPEN-SOURCE
2026.03
2.8
WEBTHINKER-32B-RL
Category=OPEN-SOURCE
2026.03
2.8
GPT-4.1
Category=DIRECT INFERENCE
2026.03
1.5
QWEN-2.5-32B
Category=DIRECT INFERENCE
2026.03
0.6
QWEN-2.5-72B
Category=DIRECT INFERENCE
2026.03
0.6
GPT-4O
Category=DIRECT INFERENCE
2026.03
0.6
QWQ-32B
Category=DIRECT INFERENCE
2026.03
0.5
Feedback
Search any
task
Search any
task