Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Understanding on CEval
Loading...
63.03
Accuracy
Qwen2.5
28.034
37.1195
46.205
55.2905
Dec 25, 2025
Dec 31, 2025
Jan 7, 2026
Jan 13, 2026
Jan 20, 2026
Jan 26, 2026
Feb 2, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5
2025.12
63.03
Qwen3
2025.12
59.16
Qwen3-4B
2026.02
50.81
PretrainRL
Backbone=Qwen3-4B
2026.02
49.06
Gamayun
2025.12
44.81
Llama3.2
2025.12
41.74
Gemma3
2025.12
34.83
SFT
Backbone=Qwen3-4B
2026.02
29.38
Feedback
Search any
task
Search any
task