Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Medical Question Answering on PMC v1.0 (test)

0.5855Accuracy

GPT-4o

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4o 2025.08		0.5855
MedVLThinker-32B RL 2025.08		0.5437
HuatuoGPT-Vision-7B 2025.08		0.5339
Qwen2.5-VL-32B-Instruct 2025.08		0.5328
HuatuoGPT-Vision-34B 2025.08		0.5254
Gemme 3 27B 2025.08		0.5205
GPT-4o-mini 2025.08		0.519
MedVLThinker-7B RL 2025.08		0.5067
Qwen2.5-VL-7B-Instruct 2025.08		0.493
MedVLThinker-3B RL 2025.08		0.4732
Qwen2.5-VL-3B-Instruct 2025.08		0.4477
Gemme 3 4B 2025.08		0.4442
MedGemma 4B 2025.08		0.4273
MedGemma 27B 2025.08		0.3675
Llava Med v1.5 Mistral 7B 2025.08		0.3428