Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Chinese-language ability on CMATH
Loading...
84.8
Accuracy
GLM-4.6V-FlashX
61.088
67.244
73.4
79.556
Jan 29, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM-4.6V-FlashX
2026.01
84.8
Ovis2.5
Parameter Scale=9B
2026.01
83.4
Qwen2.5-VL
Parameter Scale=72B
2026.01
74.8
Qwen3-VL
Parameter Scale=8B
2026.01
66.2
Ostrakon-VL
Parameter Scale=8B
2026.01
63.5
InternVL3.5
Parameter Scale=8B
2026.01
62
Feedback
Search any
task
Search any
task