Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs

About

In recent years, large language models (LLMs) have made remarkable advancements, yet hallucination, where models produce inaccurate or non-factual statements, remains a significant challenge for real-world deployment. Although current classification-based methods, such as SAPLMA, are highly efficient in mitigating hallucinations, they struggle when non-factual information arises in the early or mid-sequence of outputs, reducing their reliability. To address these issues, we propose Hallucination Detection-Neural Differential Equations (HD-NDEs), a novel method that systematically assesses the truthfulness of statements by capturing the full dynamics of LLMs within their latent space. Our approaches apply neural differential equations (Neural DEs) to model the dynamic system in the latent space of LLMs. Then, the sequence in the latent space is mapped to the classification space for truth assessment. The extensive experiments across five datasets and six widely used LLMs demonstrate the effectiveness of HD-NDEs, especially, achieving over 14% improvement in AUC-ROC on the True-False dataset compared to state-of-the-art techniques.

Qing Li, Jiahui Geng, Zongxiong Chen, Derui Zhu, Yuxia Wang, Congbo Ma, Chenyang Lyu, Fakhri Karray• 2025

Related benchmarks

TaskDatasetResultRank
Hallucination DetectionTriviaQA (test)
AUC-ROC86.3
169
Hallucination DetectionHaluEval (test)
AUC-ROC97.1
126
Hallucination DetectionTruthfulQA (test)
AUC-ROC89.5
91
Hallucination DetectionNQ (test)
AUC ROC95.2
84
Hallucination DetectionCompany
AUC-ROC0.798
68
Hallucination DetectionFact*
AUC-ROC78.6
4
Hallucination DetectionCity*
AUC ROC0.898
2
Hallucination DetectionInvention
AUC ROC0.883
2
Showing 8 of 8 rows

Other info

Follow for update