Share your thoughts, 1 month free Claude Pro on usSee more

Multi-hop Faithfulness Hallucination Detection on HoVer Refined

82.9Macro F1

FaithLens

Updated 5mo ago

Evaluation Results

Method	Links
FaithLens 2025.12		82.9
GPT-4.1 2025.12		82.6
Llama-3.1-405B-Inst 2025.12		81.6
o3 2025.12		81.1
ClearCheck 2025.12		80.3
Claude-3.7-Sonnet 2025.12		80.2
DeepSeek-V3.2 2025.12		80
o1 2025.12		79.9
o3-mini 2025.12		78.5
DeepSeek-V3.2 2025.12		76.7
MiniCheck 2025.12		74.9
GPT-4o 2025.12		73.6
AlignScore 2025.12		73.3
FactCG 2025.12		73.1