Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hallucination Detection on MHaluBench Image-to-Text Segment-level

90.44Hallucinatory Precision

Gemini-based Self-Check

78.927281.916184.90587.8939Jun 16, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
90.4471.0879.657.3583.868.175.1173.8977.4473.85
2025.06
89.5393.4791.4684.3876.3380.1588.0686.9584.985.8
2025.06
89.347.7162.1943.7687.6858.3860.3866.5367.6960.29
2025.06
88.7778.7683.4663.1778.5270.0278.6875.9778.6476.74
2025.06
87.0391.0188.9878.5270.7774.4484.682.7780.8981.71
2025.06
8279.9880.9876.0478.3577.1879.2579.0279.1679.08
2025.06
79.3774.1776.6870.5276.2273.2675.0974.9475.1974.97