Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Understanding on MMLU CF
Loading...
69.8
Score
GHS-TDA
64.912
66.181
67.45
68.719
Feb 10, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
GHS-TDA
Backbone=Qwen 2-14B
2026.02
69.8
AoT
Backbone=Qwen 2-14B
2026.02
68.9
GoT
Backbone=Qwen 2-14B
2026.02
67.8
ToT
Backbone=Qwen 2-14B
2026.02
67.6
GHS-TDA
Backbone=Llama 3-8B
2026.02
67.3
AoT
Backbone=Llama 3-8B
2026.02
66.65
GoT
Backbone=Llama 3-8B
2026.02
65.99
ToT
Backbone=Llama 3-8B
2026.02
65.71
CoT
Backbone=Llama 3-8B
2026.02
65.42
CoT
Backbone=Qwen 2-14B
2026.02
65.1
Feedback
Search any
task
Search any
task