Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Mathematical Reasoning on MathVista (Pass@1 accuracy)

73.8Pass@1 Accuracy

Qwen2.5-VL-7B-Instruct + RFT

Updated 3mo ago

Evaluation Results

Method	Links
Qwen2.5-VL-7B-Instruct + RFT 2026.04		73.8
InternVL3-9B 2026.04		71.5
Qwen2.5-VL-7B-Instruct + cold start 2026.04		71.1
Gemini-1.5-Pro 2026.04		68.7
InternVL2.5-26B 2026.04		68.2
Qwen2.5-VL-7B-Instruct 2026.04		68.2
Qwen2.5-VL-7B-Instruct 2026.04		68.1
Qwen2.5-VL-7B-Instruct 2026.04		68.1
LLaVA-OneVision-72B 2026.04		67.1
Claude-3.5-Sonnet 2026.04		64.7
InternVL2.5-8B 2026.04		64.5
Qwen2-VL-7B 2026.04		62.3
MiniCPM-V2.6 2026.04		60.8
GPT-4o-20240513 2026.04		60
Cambrian-34B 2026.04		53.2