Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Search on BC-VL
Loading...
48.6
Accuracy
Claude-4-Sonnet
9.704
19.802
29.9
39.998
Apr 15, 2026
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
Claude-4-Sonnet
Workflow=Agent Workflow
2026.04
48.6
GPT-5
Workflow=Direct Answer
2026.04
47.2
GPT-5
Workflow=Agent Workflow
2026.04
46.1
Gemini-2.5 Flash
Workflow=Agent Workflow
2026.04
44.6
POINTS-Seeker-8B
Workflow=Agentic Searc...
2026.04
44.4
Vision-DeepResearch-8B
Workflow=Agentic Searc...
2026.04
42.6
Skywork-R1V4-30B-A3B
Workflow=Agentic Searc...
2026.04
38.4
MM-DeepResearch-8B
Workflow=Agentic Searc...
2026.04
37.9
Gemini-2.5 Flash
Workflow=Direct Answer
2026.04
37.1
Qwen3-VL-8B-Thinking
Workflow=Agent Workflow
2026.04
37.1
Claude-4-Sonnet
Workflow=Direct Answer
2026.04
29.3
WebWatcher-32B
Workflow=Agentic Searc...
2026.04
27
Qwen3-VL-8B-Instruct
Workflow=Direct Answer
2026.04
25.1
WebWatcher-7B
Workflow=Agentic Searc...
2026.04
21.2
GPT-4o
Workflow=RAG Workflow
2026.04
13.4
Gemini-2.5 Flash
Workflow=RAG Workflow
2026.04
13
Qwen-2.5-VL-72B
Workflow=RAG Workflow
2026.04
11.5
Claude-3.7-Sonnet
Workflow=RAG Workflow
2026.04
11.2
Feedback
Search any
task
Search any
task