Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Grounding on CorpusQA 1M
Loading...
53.11
Score
Gemini-2.5-Pro
0.7356
14.3328
27.93
41.5272
Dec 15, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Gemini-2.5-Pro
Evaluation Framework=F...
2025.12
53.11
Gemini-2.5-Flash-Thinking
Evaluation Framework=F...
2025.12
36.91
QwenLong-L1.5-30B-A3B
Evaluation Framework=M...
2025.12
20.72
Qwen3-30B-A3B-Thinking-2507
Evaluation Framework=M...
2025.12
15.32
MemAgent-14B
Evaluation Framework=M...
2025.12
9.7
Qwen-Flash-Thinking-1M
Evaluation Framework=F...
2025.12
2.75
Feedback
Search any
task
Search any
task