Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop grounding on CorpusQA 4M
Loading...
14.29
Score
QwenLong-L1.5-30B-A3B
8.882
10.286
11.69
13.094
Dec 15, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
QwenLong-L1.5-30B-A3B
Evaluation Framework=M...
2025.12
14.29
Qwen3-30B-A3B-Thinking-2507
Evaluation Framework=M...
2025.12
9.52
MemAgent-14B
Evaluation Framework=M...
2025.12
9.09
Feedback
Search any
task
Search any
task