Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Math Reasoning on V-Math

53Accuracy

AutoTool (Qwen3-8B)

Updated 5mo ago

Evaluation Results

Method	Links
AutoTool (Qwen3-8B) 2025.12		53
AutoTool (Qwen2.5-VL-7B) 2025.12		44.3
GPT4o 2025.12		41.4
Qwen2.5-VL-72B-Instruct 2025.12		24.5
v-ToolRL 2025.12		19.5