Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Intelligence Evaluation on AGI Eval English
Loading...
92.2
Score
Qwen 3 VL 32B Think
85.648
87.349
89.05
90.751
Dec 15, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Qwen 3 VL 32B Think
Model Family=Qwen 3, P...
2025.12
92.2
Qwen 3 32B
Model Family=Qwen 3, P...
2025.12
90
K2-V2 70B Instruct
Model Family=K2, Param...
2025.12
89.6
Olmo 3.1 Think 32B
Training Stage=Final T...
2025.12
88.8
Olmo 3 Think (Final 3.0)
Training Stage=Final T...
2025.12
88.2
DS-R1 32B
Model Family=DeepSeek-...
2025.12
88.1
Olmo 3 Think (DPO)
Training Stage=DPO, Mo...
2025.12
87.8
Olmo 3 Think (SFT)
Training Stage=SFT, Mo...
2025.12
85.9
Feedback
Search any
task
Search any
task