Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Mathematical Reasoning on MathVerse (testmini)

57Accuracy

OpenAI-o1

Updated 4mo ago

Evaluation Results

Method	Links
OpenAI-o1 2025.06		57
Qwen2.5-VL-72B-IT 2025.06		55.8
Perception-R1-7B 2025.06		54.3
Vision-R1-7B 2025.06		52.4
Claude-3.7-Sonnet 2025.06		52
MM-Eureka-7B 2025.06		51.9
VLAA-Thinker-7B 2025.06		51.2
GPT-4o 2025.06		50.2
SophiaVL-R1-7B 2025.06		49
Qwen2.5-VL-7B-IT 2025.06		47.4
OpenVLThinker-7B 2025.06		47.4
R1-OneVision-7B 2025.06		46.5
URSA-7B 2025.06		45.7
R1-VL-7B 2025.06		40.8
InternVL2.5-8B 2025.06		39.5
Qwen2-VL-7B-IT 2025.06		31.1