Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-Hop Question Answering on TriviaQA (out-of-domain)
Loading...
63.4
Accuracy
EvolveR
34.488
41.994
49.5
57.006
Feb 9, 2026
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
EvolveR
Backbone=Qwen2.5-7B-In...
2026.02
63.4
SKILLRL
Backbone=Qwen2.5-7B-In...
2026.02
63.3
ZeroSearch
Backbone=Qwen2.5-7B-In...
2026.02
61.8
Search-R1
Backbone=Qwen2.5-7B-In...
2026.02
61
RAG
Backbone=Qwen2.5-7B-In...
2026.02
58.2
R1-Instruct
Backbone=Qwen2.5-7B-In...
2026.02
44.9
Search-o1
Backbone=Qwen2.5-7B-In...
2026.02
40.6
Qwen2.5
Backbone=Qwen2.5-7B-In...
2026.02
35.6
CoT
Backbone=Qwen2.5-7B-In...
2026.02
35.6
Feedback
Search any
task
Search any
task