Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Question Answering on QA Benchmark Suite Aggregate

0.331Average Score

Search-R1++

0.043960.118480.1930.26752Feb 23, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
0.331
2026.02
0.289
2026.02
0.229
2026.02
0.055