Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on QA Benchmark Suite Aggregate
Loading...
0.331
Average Score
Search-R1++
0.04396
0.11848
0.193
0.26752
Feb 23, 2026
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Score
Search-R1++
Base Model=Qwen2.5-3B,...
2026.02
0.331
Search-R1
Base Model=Qwen2.5-3B,...
2026.02
0.289
R1-base
Base Model=Qwen2.5-3B,...
2026.02
0.229
ReAct
Base Model=Qwen2.5-3B,...
2026.02
0.055
Feedback
Search any
task
Search any
task