Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Knowledge and Reasoning on BBH, MMLU, CMMLU, and C-Eval Suite
Loading...
59.48
BBH
Qwen3-Inst
24.9208
33.8929
42.865
51.8371
Nov 28, 2025
BBH
MMLU
CMMLU
C-Eval
Updated 4d ago
Evaluation Results
Method
Method
Links
BBH
MMLU
CMMLU
C-Eval
Qwen3-Inst
Architecture=Dense, To...
2025.11
59.48
63.05
60.84
62.7
HSA-UL-Inst
Architecture=MoE, Tota...
2025.11
57.25
61.34
64.06
62.86
Qwen3-Inst
Architecture=Dense, To...
2025.11
42.56
45.87
41.64
43.81
HSA-UL-Inst
Architecture=Dense, To...
2025.11
26.25
42.24
43.33
45.41
Feedback
Search any
task
Search any
task