Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Math Reasoning on R1-OV Bench
Loading...
38.9
Accuracy
AVATAR
34.532
35.666
36.8
37.934
Aug 5, 2025
Accuracy
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy
AVATAR
backbone=Qwen2.5-VL-7B
2025.08
38.9
AVATAR
backbone=Qwen2.5-VL-7B...
2025.08
38.5
VLAAThinker-VL-7B
2025.08
38.4
AVATAR
backbone=Qwen2.5-VL-7B...
2025.08
37.5
AVATAR
backbone=Qwen2.5-VL-7B...
2025.08
37
Qwen2.5-VL-7B + GRPO
training=GRPO
2025.08
36.8
Qwen2.5-VL-7B
status=Baseline
2025.08
34.9
OpenVLThinker-7B
2025.08
34.7
Feedback
Search any
task
Search any
task