Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Question Answering on MMhops Comparison (test)
Loading...
29.39
Accuracy
Gemini-2.5-pro
2.9324
9.8012
16.67
23.5388
Dec 15, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-2.5-pro
2025.12
29.39
Gemini-2.5-flash
2025.12
23.18
MMhops-R1
Base Model=Qwen2.5-vl-...
2025.12
22.01
Self-Ask
Base Model=GPT-4o, Ret...
2025.12
18.27
OmniSearch
Base Model=GPT-4o, Ret...
2025.12
17.02
Vanilla mRAG
Base Model=Qwen2.5-vl-...
2025.12
9.72
GPT-4o
2025.12
8.76
Zero-shot
Base Model=Qwen2.5-vl-...
2025.12
7.59
GPT-4o-mini
2025.12
7.05
Search-r1
Base Model=Qwen2.5-7b-...
2025.12
6.62
Zero-shot
Base Model=Qwen2.5-vl-...
2025.12
6.2
EchoSight
Base Model=LLaMA3, Ret...
2025.12
4.81
Vanilla mRAG
Base Model=Qwen2.5-vl-...
2025.12
3.95
Feedback
Search any
task
Search any
task