Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding on MAT-Coding
Loading...
30.6
F1 Score
Qwen2.5-VL-3B
16.352
20.051
23.75
27.449
Dec 25, 2025
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
Qwen2.5-VL-3B
Training Approach=GRPO
2025.12
30.6
Qwen2.5-VL-3B
Training Approach=Base
2025.12
16.9
Feedback
Search any
task
Search any
task