Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hallucination Detection on MMEvalPro perception

98.6F1 (Faithful)

Auxiliary Model

66.25674.65383.0591.447Dec 13, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
98.697.8
2025.12
91.514.9
2025.12
84.830.8
2025.12
74.925.4
2025.12
67.58.9