Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Question Answering on TriviaQA (test) (F1 score)
Loading...
66.1
F1
Search-R1 + EKA
16.596
29.448
42.3
55.152
Dec 23, 2025
F1
Updated 4d ago
Evaluation Results
Method
Method
Links
F1
Search-R1 + EKA
Backbone=Qwen2.5-7B-In...
2025.12
66.1
Search-R1
Backbone=Qwen2.5-7B-In...
2025.12
61
Rejection Sampling
Backbone=Qwen2.5-7B-In...
2025.12
59.2
Standard RAG
Backbone=Qwen2.5-7B-In...
2025.12
58.5
R1-base
Backbone=Qwen2.5-7B-In...
2025.12
53.9
R1-instruct
Backbone=Qwen2.5-7B-In...
2025.12
53.7
IRCoT
Backbone=Qwen2.5-7B-In...
2025.12
47.8
Search-o1
Backbone=Qwen2.5-7B-In...
2025.12
44.3
Direct Inference
Backbone=Qwen2.5-7B-In...
2025.12
40.8
SFT
Backbone=Qwen2.5-7B-In...
2025.12
35.4
CoT
Backbone=Qwen2.5-7B-In...
2025.12
18.5
Feedback
Search any
task
Search any
task