Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Mathematical Reasoning on MathVerse (Pass@1 Accuracy)

47.2Pass@1 Accuracy

Qwen2.5-VL-7B-Instruct + RFT

Updated 3mo ago

Evaluation Results

Method	Links
Qwen2.5-VL-7B-Instruct + RFT 2026.04		47.2
Claude-3.5-Sonnet 2026.04		44.2
Qwen2.5-VL-7B-Instruct + cold start 2026.04		44.1
Qwen2.5-VL-7B-Instruct 2026.04		43.9
Qwen2.5-VL-7B-Instruct 2026.04		43.3
Qwen2.5-VL-7B-Instruct 2026.04		43.3
GPT-4o-20240513 2026.04		40.6
InternVL3-9B 2026.04		35.3
Gemini-1.5-Pro 2026.04		30.1
Qwen2-VL-7B 2026.04		30.1
LLaVA-OneVision-72B 2026.04		27.2
InternVL2.5-26B 2026.04		24
InternVL2.5-8B 2026.04		22.8
MiniCPM-V2.6 2026.04		18.9