Share your thoughts, 1 month free Claude Pro on usSee more

Hallucination Detection on MMEvalPro perception

98.6F1 (Faithful)

Auxiliary Model

Updated 3mo ago

Evaluation Results

Method	Links
Auxiliary Model 2025.12		98.6	97.8
HaloScope 2025.12		91.5	14.9
Prompting 2025.12		84.8	30.8
SAPLMA 2025.12		74.9	25.4
kNN 2025.12		67.5	8.9