Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Agentic Search benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Agentic Search
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Agentic Search Benchmarks Average
PDA-GAM
Average Token-level F1
58
30
1d ago
MMSearch v1.0 (test)
POINTS-Seeker-8B
Accuracy
70.8
21
1mo ago
Search Unseen
RESKILL
PopQA Accuracy
52.3
19
1d ago
Search Seen
RESKILL
NQ Accuracy
51.6
19
1d ago
Xbench DeepSearch 2505
LiteResearcher-4B
Accuracy
78
18
1mo ago
BC-VL
Claude-4-Sonnet
Accuracy
48.6
18
1mo ago
BrowseComp
DeepSeek-V3.2
Accuracy
67.6
16
1mo ago
GAIA Search
Weighted Voting
Accuracy
96.9
14
1d ago
xbench DeepSearch
Solution Aggregation
Accuracy
61
14
1d ago
DeepSearchQA
FINEVERIFY
Accuracy
90
14
1d ago
BrowseComp-Plus
Solution Aggregation
Accuracy
66.5
14
1d ago
BrowseComp
Seed-2.0-Pro
Score
77.3
14
27d ago
LongSeAL
Vanilla-Qwen3-32B
String-F1
13.5
14
1mo ago
Frames
Vanilla-Qwen3-32B
String-F1
36.6
14
1mo ago
BrowseComp
Vanilla-Qwen3-32B
String-F1
21
14
1mo ago
Bamboogle
Vanilla-Qwen3-32B
String-F1 Score
73.1
14
1mo ago
2WikiMultiHopQA
Vanilla-Qwen3-32B
String-F1
69.9
14
1mo ago
HotpotQA
Vanilla-Qwen3-32B
String-F1
0.603
14
1mo ago
MuSiQue
Vanilla-Qwen3-32B
String F1 Score
32.5
14
1mo ago
GAIA text
MiroThinker-1.7-mini
Score
80.3
12
27d ago
Web Dancer
Laser
LJFT
49.24
12
3mo ago
Bamboogle
Laser
LJFT Score
64.8
12
3mo ago
BrowseComp-ZH (test)
Laser
LJFT
21.45
12
3mo ago
MMSearch+
POINTS-Seeker-8B
Accuracy
25.2
10
1mo ago
WebWalker
LiteResearcher-4B
Accuracy
72.7
9
1mo ago
Showing 25 of 38 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs