Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Single-Hop Search-augmented Question Answering on NQ (success rate (%))
Loading...
37
Success Rate
+ MASA
18.384
23.217
28.05
32.883
May 29, 2026
Success Rate
Updated 2d ago
Evaluation Results
Method
Method
Links
Success Rate
+ MASA
Backbone=Qwen3-32B
2026.05
37
+ MASA
Backbone=Qwen3-8B
2026.05
36.4
+ MASA
Backbone=Qwen3-14B
2026.05
35.6
+ MASA
Backbone=Qwen3-4B
2026.05
35.5
+ Base Skill
Backbone=Qwen3-14B
2026.05
35.3
+ Base Skill
Backbone=Qwen3-4B
2026.05
34.5
+ DS-Adapter
Backbone=Qwen3-32B
2026.05
34.4
+ Base Skill
Backbone=Qwen3-8B
2026.05
34
+ DS-Adapter
Backbone=Qwen3-14B
2026.05
33.9
No Skill
Backbone=Qwen3-14B
2026.05
33.8
+ Base Skill
Backbone=Qwen3-32B
2026.05
33.8
+ DS-Adapter
Backbone=Qwen3-8B
2026.05
33.2
+ DS-Adapter
Backbone=Qwen3-4B
2026.05
33
No Skill
Backbone=Qwen3-4B
2026.05
29.4
No Skill
Backbone=Qwen3-32B
2026.05
29.1
No Skill
Backbone=Qwen3-8B
2026.05
19.1
Feedback
Search any
task
Search any
task