Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on MathVerse (test)
Loading...
44.88
Accuracy
Qwen2.5-VL-7B + Cont. Reward
41.9888
42.7394
43.49
44.2406
Nov 20, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-VL-7B + Cont. Reward
reward_type=Continuous
2025.11
44.88
Vision-Zero
external supervision=true
2025.11
43.86
Qwen2.5-VL-7B (Baseline)
2025.11
43.78
Qwen2.5-VL-7B + Discrete Reward
reward_type=Discrete
2025.11
42.1
Feedback
Search any
task
Search any
task