Share your thoughts, 1 month free Claude Pro on usSee more

Hallucination Detection on ChatProtect SC

87F1 Score

HalluClean

Updated 4mo ago

Evaluation Results

Method	Links
HalluClean 2025.11		87	87
GPT-4o-mini 2025.11		84.2	72.7
ChatProtect 2025.11		83.8	84.7
HalluClean 2025.11		80.8	83.3
HalluClean 2025.11		76.1	80.3
Step-by-Step 2025.11		68.1	75
Plan-and-Solve 2025.11		66.4	73
Llama-3-70B 2025.11		65.8	73.6
DeepSeek-V3 2025.11		52	67.3
GPT-3.5-turbo 2025.11		46	64
DeepSeek-R1 2025.11		40	62
SelfCheckGPT 2025.11		5.7	12