Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-hop Question Answering on NQ Large-scale (train)
Loading...
1.3
Avg Search Steps
NQ
1.235
1.2675
1.3
1.3325
Jan 26, 2026
Avg Search Steps
Avg Recall@8
Updated 1mo ago
Evaluation Results
Method
Method
Links
Avg Search Steps
Avg Recall@8
NQ
Annotation=Human
2026.01
1.3
83.1
Feedback
Search any
task
Search any
task