Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on Bamboo
Loading...
50.4
Accuracy
CoT
10.464
20.832
31.2
41.568
Feb 13, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CoT
Backbone=Qwen2.5-14B-I...
2026.02
50.4
CDKC
Backbone=Qwen2.5-14B-I...
2026.02
44
GRPO
Backbone=Qwen2.5-14B-I...
2026.02
39.2
Vanilla SFT
Backbone=Qwen2.5-14B-I...
2026.02
36
CoT
Backbone=Qwen2.5-3B-In...
2026.02
32.8
Vanilla LLM
Backbone=Qwen2.5-14B-I...
2026.02
28.8
CDKC
Backbone=Qwen2.5-3B-In...
2026.02
26.4
RAG
Backbone=Qwen2.5-14B-I...
2026.02
25.6
GRPO
Backbone=Qwen2.5-3B-In...
2026.02
22.4
Vanilla SFT
Backbone=Qwen2.5-3B-In...
2026.02
20
CGKE
Backbone=Qwen2.5-14B-I...
2026.02
20
CGKE
Backbone=Qwen2.5-3B-In...
2026.02
17.6
RAG
Backbone=Qwen2.5-3B-In...
2026.02
13.6
Vanilla LLM
Backbone=Qwen2.5-3B-In...
2026.02
12
Feedback
Search any
task
Search any
task