Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Language Performance on Aggregate Suite
Loading...
78.03
Average Score
Qwen3-4B
46.0188
54.3294
62.64
70.9506
Dec 8, 2025
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
Qwen3-4B
Params=4B
2025.12
78.03
Qwen2.5-3B
Params=3B
2025.12
71.54
Qwen3-1.7B
Params=1.7B
2025.12
68.44
Qwen2.5-1.5B
Params=1.5B
2025.12
65.57
SmolLM3-3B
Params=3B
2025.12
64.61
llama-3.2-3B
Params=3B
2025.12
62.31
YuLan-Mini-2.4B
Params=2.4B
2025.12
61.7
Qwen2-1.5B
Params=1.5B
2025.12
61.08
PCMind-2.1-Kaiyuan-2B
Params=2B
2025.12
59.07
Qwen3-0.6B
Params=0.6B
2025.12
57.11
gemma2-2B
Params=2B
2025.12
53.26
SmolLM2-1.7B
Params=1.7B
2025.12
51.89
OLMo-2-0425-1B
Params=1B
2025.12
48.6
llama-3.2-1B
Params=1B
2025.12
47.25
Feedback
Search any
task
Search any
task