Share your thoughts, 1 month free Claude Pro on usSee more

Hallucination Evaluation on HallB

54.2Score

SAIL

Updated 1mo ago

Evaluation Results

Method	Links
SAIL 2025.10		54.2
Qwen2.5-VL 2025.10		52.9
Encoder-Based 2025.10		51.4
Qwen2-VL 2025.10		50.6
InternVL2.5 2025.10		50.1
InternVL3 2025.10		49.9
NEO 2025.10		46.4
Qwen2.5-VL 2025.10		46.3
Encoder-Based 2025.10		44.4
NEO 2025.10		43.1
InternVL2.5 2025.10		42.6
InternVL3 2025.10		42.5
Qwen2-VL 2025.10		41.7
HoVLE 2025.10		38.4
BREEN 2025.10		37
Mono-InternVL 2025.10		34.8
Mono-InternVL-1.5 2025.10		32.5
EVE 2025.10		26.4
Chameleon 2025.10		17.1