Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Mathematical Reasoning on WeMath mini (test)

79.5Accuracy

Qwen3-VL-4B-Instruct-Math-RL Teacher

40.81250.85660.970.944Jun 8, 2025Aug 2, 2025Sep 26, 2025Nov 20, 2025Jan 14, 2026Mar 10, 2026May 5, 2026
Updated 28d ago

Evaluation Results

MethodLinks
79.5
2025.06
72.6
2025.06
72
2025.06
71.9
2025.06
68.8
2025.06
66.3
2025.06
66.3
2025.06
65.6
2026.05
65
2025.06
64.8
2026.05
64.8
2025.06
61.9
2025.06
61.4
2026.05
58.7
2026.05
57.6
2025.06
53.5
48.6
2025.06
42.3