Share your thoughts, 1 month free Claude Pro on usSee more

Multimodal Visual Question Answering on MMBench

88.6Score

Qwen2.5-VL

Updated 1mo ago

Evaluation Results

Method	Links
Qwen2.5-VL 2026.04		88.6
GIFT 2025.10		87.2
Greedy 2025.10		86.9
Greedy 2025.10		84.6
GIFT 2025.10		84.6
Qwen2.5-VL 2026.04		83.5
Qwen2.5-VL 2026.04		82.47
ForeSight 2026.04		81.5
GIFT 2025.10		75.8
Greedy 2025.10		75.6
Greedy 2025.10		73.1
GIFT 2025.10		73.1
Qwen2-VL-2B-Instruct (DPO, HighAvg.) 2025.05		72.7
Qwen2-VL-2B-Instruct (DPO, LowAvg.) 2025.05		72.3
Qwen2-VL-2B-Instruct (Zeroshot) 2025.05		72
Qwen2-VL-2B-Instruct (DPO, HighVar.) 2025.05		72
Qwen2-VL-2B-Instruct (DPO, Random) 2025.05		71.9
Qwen2-VL-2B-Instruct (DPO, Full) 2025.05		71.6