KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques

About

Large language models (LLMs) have demonstrated impressive generative capabilities with the potential to innovate in medicine. However, the application of LLMs in real clinical settings remains challenging due to the lack of factual consistency in the generated content. In this work, we develop an augmented LLM framework, KG-Rank, which leverages a medical knowledge graph (KG) along with ranking and re-ranking techniques, to improve the factuality of long-form question answering (QA) in the medical domain. Specifically, when receiving a question, KG-Rank automatically identifies medical entities within the question and retrieves the related triples from the medical KG to gather factual information. Subsequently, KG-Rank innovatively applies multiple ranking techniques to refine the ordering of these triples, providing more relevant and precise information for LLM inference. To the best of our knowledge, KG-Rank is the first application of KG combined with ranking models in medical QA specifically for generating long answers. Evaluation on four selected medical QA datasets demonstrates that KG-Rank achieves an improvement of over 18% in ROUGE-L score. Additionally, we extend KG-Rank to open domains, including law, business, music, and history, where it realizes a 14% improvement in ROUGE-L score, indicating the effectiveness and great potential of KG-Rank.

Rui Yang, Haoran Liu, Edison Marrese-Taylor, Qingcheng Zeng, Yu He Ke, Wanxin Li, Lechao Cheng, Qingyu Chen, James Caverlee, Yutaka Matsuo, Irene Li• 2024

Related benchmarks

Task	Dataset	Result
Multi-choice medical QA	Multi-choice medical QA benchmarks (test)	MMLU-Med Accuracy45.2	28
Medical Reasoning	MedDDx (test)	Basic Accuracy25.3	28
Medical Reasoning	NEEMRs	Recall40	22
Medical Reasoning	XMEMRs	Recall35.39	22
Multi-choice Medical Question Answering	Medical QA Multi-choice	MMLU-Med Accuracy45.2	22
Medical Reasoning	MedDDx	Basic Accuracy25.3	22

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord