Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Hallucination Evaluation on MMHal-Bench

4.84Average Score

GIFT

2.0842.79953.5154.2305Oct 15, 2025Nov 21, 2025Dec 29, 2025Feb 4, 2026Mar 14, 2026Apr 20, 2026May 28, 2026
Updated 2d ago

Evaluation Results

MethodLinks
2025.10
4.84--26.4
2025.10
4.8--28.3
2025.10
4.6729--
2025.10
4.6329--
2025.10
4.5631--
2025.10
4.5232--
2026.05
4.0140.62--
2026.05
3.7543.75--
2026.05
3.72---
2026.05
3.64---
2025.10
3.58--27.5
2025.10
3.53--32.7
2026.05
3.49---
2026.05
3.49---
2026.05
3.48---
2026.05
3.47---
2026.05
3.3555.21--
2026.05
3.3452.08--
2026.05
3.3354.17--
2026.05
3.2955.21--
2026.05
3.2955.21--
2026.05
3.2754.17--
2026.05
3.2755.21--
2026.05
3.2440.229.9-
2026.05
3.2454.17--
2026.05
3.126148.9-
2026.05
3.157.29--
2026.05
3.0948.440.8-
2026.05
3.0543.634.7-
2026.05
3.0541--
2026.05
3.0160.42--
2026.05
338--
2026.05
2.9961.46--
2026.05
2.9861.46--
2026.05
2.9759.38--
2026.05
2.9537--
2026.05
2.9149.744.4-
2026.05
2.91---
2026.05
2.9163.54--
2026.05
2.9163.54--
2026.05
2.8755.449.8-
2026.05
2.8662.5--
2026.05
2.8361.46--
2026.05
2.8248--
2026.05
2.8149--
2026.05
2.7966.67--
2026.05
2.7645--
2026.04
2.7445--
2026.05
2.7246--
2026.05
2.72---
2025.10
2.72--55.8
2026.05
2.7156.752.3-
2026.05
2.6866.67--
2026.05
2.6665.62--
2026.05
2.6249--
2026.05
2.62---
2025.10
2.61--56.2
2026.04
2.5845--
2026.05
2.5845--
2026.05
2.5765.455.8-
2026.05
2.5768.75--
2026.05
2.57---
2026.05
2.57---
2026.05
2.5668.75--
2025.10
2.5559--
2026.05
2.55---
2025.10
2.5360--
2026.05
2.5352--
2025.10
2.5261--
2025.10
2.52--56.2
2026.04
2.549--
2026.04
2.4849--
2026.05
2.4852--
2025.10
2.48--57.3
2026.05
2.4751--
2025.10
2.46--59.2
2026.04
2.451--
2025.10
2.4--60.8
2026.04
2.3954--
2026.05
2.3954--
2025.10
2.3764--
2025.10
2.37--60.5
2026.04
2.3658--
2025.10
2.3565--
2026.04
2.3553--
2026.05
2.35---
2025.10
2.3265--
2025.10
2.3264--
2025.10
2.31--61.5
2026.04
2.354--
2026.05
2.3---
2025.10
2.2765--
2026.05
2.2764--
2026.04
2.2554--
2026.05
2.2554--
2026.04
2.2257--
2025.10
2.22--65.2
2025.10
2.2150--
2025.10
2.2150--
2026.05
2.19---
Showing 100 of 129 rows