Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning on BIG-Bench Extra Hard

37.8Score

Qwen3-30B-A3B-Inst-2507

13.890420.097726.30532.5123Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
37.81
2026.02
35.773.17
2026.02
33.515.04
27.864.6
23.241
2026.02
18.27-
16.472.03
2026.02
15.781.66
2026.02
15.33.19
14.81-