Share your thoughts, 1 month free Claude Pro on usSee more

Web Navigation Agentic Reasoning on BrowseComp complete (test)

88.2Avg Success Rate@3

MiroThinker-H1

Updated 4mo ago

Evaluation Results

Method	Links
MiroThinker-H1 2026.03		88.2
Gemini-3.1-Pro 2026.03		85.9
Claude-4.6-Opus 2026.03		84
OpenAI-GPT-5.4 2026.03		82.7
Qwen3.5-397B 2026.03		78.6
Kimi-K2.5 2026.03		78.4
Seed-2.0-Pro 2026.03		77.3
Minimax-M2.5 2026.03		76.3
GLM-5.0 2026.03		75.9
MiroThinker-1.7 2026.03		74
MiroThinker-1.7-mini 2026.03		67.9
Claude-4.5-Opus 2026.03		67.8
DeepSeek-V3.2 2026.03		67.6
Gemini-3.0-Pro 2026.03		59.2
OpenAI-GPT-5 2026.03		54.9
Tongyi-DeepResearch-30B 2026.03		43.4