Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Reasoning on MMBench EN

83Accuracy

Instruct

Updated 2mo ago

Evaluation Results

Method	Links
Instruct 2026.03		83
DPO 2026.03		83
DPO 2026.03		83
DPO-Shift 2026.03		83
ACPO 2026.03		83
Instruct 2026.03		82.8
IPO 2026.03		82
IPO 2026.03		82
β-DPO 2026.03		82
SimPO 2026.03		81.5
SimPO 2026.03		81
β-DPO 2026.03		81
DPO-Shift 2026.03		81
ACPO 2026.03		81
F3A 2026.05		77.73
Qwen3-VL-2B 2026.05		77.5
DivPrune 2026.05		77.5
VisionZip 2026.05		77.5
F3A 2026.05		77.5
CDPruner 2026.05		77.08
CDPruner 2026.05		77.02
DivPrune 2026.05		76.83
VisionZip 2026.05		76.46
F3A 2026.05		76.07
FastV 2026.05		76
CDPruner 2026.05		75.93
FastV 2026.05		74.66
VisionZip 2026.05		74.5
DivPrune 2026.05		74.47
FastV 2026.05		72.07
LLaVA-1.5-13B + MMFuser 2024.10		69.9
LLaVA-1.5-13B 2024.10		67.7
LLaVA-1.5-7B + MMFuser 2024.10		67.5
LLaVA-1.5-7B 2024.10		64.3
Qwen-VL-Chat 2024.10		60.6
Shikra 2024.10		58.8
IDEFICS-80B 2024.10		54.5
IDEFICS-9B 2024.10		48.2
Qwen-VL 2024.10		38.2
InstructBLIP 2024.10		36