Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Question Answering on FRAMES (Accuracy)
Loading...
50
Accuracy
gemini-2.5-flash
11.832
21.741
31.65
41.559
Jan 26, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
gemini-2.5-flash
Training Data=-, Backb...
2026.01
50
gemini-2.0-flash
Training Data=-, Backb...
2026.01
45.9
SAGE
Training Data=SAGE, Ba...
2026.01
32.3
Search-R1
Training Data=NQ + Hot...
2026.01
26.2
Musique-trained Agent
Training Data=Musique,...
2026.01
25
SAGE
Training Data=SAGE, Ba...
2026.01
23.8
Musique-trained Agent
Training Data=Musique,...
2026.01
21.5
Search-R1
Training Data=NQ + Hot...
2026.01
13.3
Feedback
Search any
task
Search any
task