Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Short-form Question Answering on Bamboogle (test)
Loading...
0.3489
EM
VSPO + PRS_short
0.102524
0.166487
0.23045
0.294413
Dec 8, 2025
EM
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
VSPO + PRS_short
Backbone=Qwen2.5-3B-In...
2025.12
0.3489
GRPO + PRS_short
Backbone=Qwen2.5-3B-In...
2025.12
0.3333
PPO + PRS_short
Backbone=Qwen2.5-3B-In...
2025.12
0.3325
Untrained
Backbone=Qwen2.5-3B-In...
2025.12
0.1667
SFT
Backbone=Qwen2.5-3B-In...
2025.12
0.112
Feedback
Search any
task
Search any
task