Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hallucination Evaluation on MMHal

4Score

GPT-4o-20240513

2.182.65253.1253.5975Dec 6, 2024
Updated 2d ago

Evaluation Results

MethodLinks
2024.12
4--
2024.12
3.89--
2024.12
3.83--
2024.12
3.75--
2024.12
3.71--
2024.12
3.7--
2024.12
3.65--
2024.12
3.6--
2024.12
3.55--
2024.12
3.4--
2024.12
3.33--
2024.12
3.31--
2024.12
3.11--
2024.12
2.94--
2024.12
2.75--
2024.12
2.52--
2024.12
2.49--
2024.12
2.25--
2025.04
-31.970.8
2025.04
-4052.1
2025.04
-42.147.9