Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Contextual Reasoning on BigBenchHard
Loading...
63.23
EM
Sigma-MoE-Tiny Base
40.5996
46.4748
52.35
58.2252
Dec 18, 2025
EM
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
Sigma-MoE-Tiny Base
# Shots=3-shot, Archit...
2025.12
63.23
Gemma-3 4B Base
# Shots=3-shot, Archit...
2025.12
51.7
DeepSeek-V2 Lite
# Shots=3-shot, Archit...
2025.12
44.1
Qwen3 0.6B Base
# Shots=3-shot, Archit...
2025.12
41.47
Feedback
Search any
task
Search any
task