Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Single-Hop Question Answering on TriviaQA (out-of-domain)
Loading...
68
Accuracy
GEPO
34.304
43.052
51.8
60.548
Oct 30, 2025
Nov 16, 2025
Dec 3, 2025
Dec 20, 2025
Jan 6, 2026
Jan 23, 2026
Feb 9, 2026
Accuracy
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy
GEPO
Type=RL Training, Mode...
2025.10
68
GEPO
Type=RL Training, Mode...
2025.10
65.2
GiGPO
Type=RL Training, Mode...
2025.10
64.7
EvolveR
Backbone=Qwen2.5-7B-In...
2026.02
63.4
SKILLRL
Backbone=Qwen2.5-7B-In...
2026.02
63.3
ZeroSearch
Backbone=Qwen2.5-7B-In...
2026.02
61.8
ZeroSearch
Type=RL Training, Mode...
2025.10
61.8
Search-R1
Backbone=Qwen2.5-7B-In...
2026.02
61
Search-R1
Type=RL Training, Mode...
2025.10
61
GiGPO
Type=RL Training, Mode...
2025.10
59.5
RAG
Backbone=Qwen2.5-7B-In...
2026.02
58.2
ZeroSearch
Type=RL Training, Mode...
2025.10
57.4
Search-R1
Type=RL Training, Mode...
2025.10
54.5
R1-Instruct
Type=RL Training, Mode...
2025.10
53.7
R1-Instruct
Backbone=Qwen2.5-7B-In...
2026.02
44.9
R1-Instruct
Type=RL Training, Mode...
2025.10
44.9
Search-o1
Backbone=Qwen2.5-7B-In...
2026.02
40.6
Qwen2.5
Backbone=Qwen2.5-7B-In...
2026.02
35.6
CoT
Backbone=Qwen2.5-7B-In...
2026.02
35.6
Feedback
Search any
task
Search any
task