Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature

About

Recent advancements in large language models (LLMs) have achieved promising performances across various applications. Nonetheless, the ongoing challenge of integrating long-tail knowledge continues to impede the seamless adoption of LLMs in specialized domains. In this work, we introduce DALK, a.k.a. Dynamic Co-Augmentation of LLMs and KG, to address this limitation and demonstrate its ability on studying Alzheimer's Disease (AD), a specialized sub-field in biomedicine and a global health priority. With a synergized framework of LLM and KG mutually enhancing each other, we first leverage LLM to construct an evolving AD-specific knowledge graph (KG) sourced from AD-related scientific literature, and then we utilize a coarse-to-fine sampling method with a novel self-aware knowledge retrieval approach to select appropriate knowledge from the KG to augment LLM inference capabilities. The experimental results, conducted on our constructed AD question answering (ADQA) benchmark, underscore the efficacy of DALK. Additionally, we perform a series of detailed analyses that can offer valuable insights and guidelines for the emerging topic of mutually enhancing KG and LLM. We will release the code and data at https://github.com/David-Li0406/DALK.

Dawei Li, Shu Yang, Zhen Tan, Jae Young Baik, Sukwon Yun, Joseph Lee, Aaron Chacko, Bojian Hou, Duy Duong-Tran, Ying Ding, Huan Liu, Li Shen, Tianlong Chen• 2024

Related benchmarks

TaskDatasetResultRank
Knowledge Base Question AnsweringWebQSP
Hits@158.9
53
Knowledge Base Question AnsweringCWQ
Hits@145.8
30
Multi-hop Question AnsweringMultihopQA
Accuracy53.952
24
Attributed Question AnsweringALCE
STRREC Score21.408
24
Multiple-choice Question AnsweringQUALITY
Accuracy34.251
19
Question AnsweringMusiqueQA
Accuracy11.367
16
Question AnsweringPopQA
Accuracy45.604
13
Knowledge Base Question AnsweringGrailQA
Hits@156.7
11
Question AnsweringHotpotQA
Accuracy33.252
10
Graph ReasoningG-bench CS
Inference Time (s)26.8
9
Showing 10 of 11 rows

Other info

Follow for update