Share your thoughts, 1 month free Claude Pro on usSee more

Generative Hallucination Mitigation on MMHal-Bench

3.49Overall Score

GPT-4V

Updated 3mo ago

Evaluation Results

Method	Links
GPT-4V 2026.04		3.49	28
PSRD 2026.04		2.92	49
HSA-DPO 2026.04		2.61	48
Octopus 2026.04		2.61	50
LLaVA-DPO 2026.04		2.58	50
SENA 2026.04		2.33	52
AVISC 2026.04		2.19	59
OPERA 2026.04		2.15	54
M3ID 2026.04		2.14	61
STIC 2026.04		2.07	56
EOS 2026.04		2.03	59
VCD 2026.04		1.96	64
LLaVA-1.5-7B (Base) 2026.04		1.55	76