Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on SeaQA
Loading...
86.03
Accuracy
CDKC
33.406
47.068
60.73
74.392
Feb 13, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
CDKC
Backbone=Qwen2.5-14B-I...
2026.02
86.03
GRPO
Backbone=Qwen2.5-14B-I...
2026.02
85
CDKC
Backbone=Qwen2.5-3B-In...
2026.02
71.33
GRPO
Backbone=Qwen2.5-3B-In...
2026.02
69.43
CoT
Backbone=Qwen2.5-14B-I...
2026.02
67.63
CGKE
Backbone=Qwen2.5-14B-I...
2026.02
66.43
Vanilla SFT
Backbone=Qwen2.5-14B-I...
2026.02
65.97
Vanilla LLM
Backbone=Qwen2.5-14B-I...
2026.02
59.67
RAG
Backbone=Qwen2.5-14B-I...
2026.02
57.4
RAG
Backbone=Qwen2.5-3B-In...
2026.02
45.37
Vanilla SFT
Backbone=Qwen2.5-3B-In...
2026.02
44.53
CGKE
Backbone=Qwen2.5-3B-In...
2026.02
43.17
CoT
Backbone=Qwen2.5-3B-In...
2026.02
40.47
Vanilla LLM
Backbone=Qwen2.5-3B-In...
2026.02
35.43
Feedback
Search any
task
Search any
task