Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search

About

Knowledge Base Question Answering (KBQA) aims to answer natural language questions with a large-scale structured knowledge base (KB). Despite advancements with large language models (LLMs), KBQA still faces challenges in weak KB awareness, imbalance between effectiveness and efficiency, and high reliance on annotated data. To address these challenges, we propose KBQA-o1, a novel agentic KBQA method with Monte Carlo Tree Search (MCTS). It introduces a ReAct-based agent process for stepwise logical form generation with KB environment exploration. Moreover, it employs MCTS, a heuristic search method driven by policy and reward models, to balance agentic exploration's performance and search space. With heuristic exploration, KBQA-o1 generates high-quality annotations for further improvement by incremental fine-tuning. Experimental results show that KBQA-o1 outperforms previous low-resource KBQA methods with limited annotated data, boosting Llama-3.1-8B model's GrailQA F1 performance to 78.5% compared to 48.5% of the previous sota method with GPT-3.5-turbo. Our code is publicly available.

Haoran Luo, Haihong E, Yikai Guo, Qika Lin, Xiaobao Wu, Xinyu Mu, Wenhao Liu, Meina Song, Yifan Zhu, Luu Anh Tuan• 2025

Related benchmarks

TaskDatasetResultRank
Knowledge Base Question AnsweringWEBQSP (test)--
145
Knowledge Graph Question AnsweringWebQSP--
143
Knowledge Base Question AnsweringCWQ (ComplexWebQuestions)
Hits@1 Accuracy72
32
Knowledge Base Question AnsweringGrailQA
Hits@172.9
31
Knowledge Base Question AnsweringWebQuestion
Hits@182.5
24
Knowledge Base Question AnsweringGraphQ (test)
F148.7
19
Multi-hop ReasoningWebQSP
Hits@188.3
10
Multi-hop ReasoningCWQ
Hits@172
10
Multi-hop reasoning Knowledge Base Question AnsweringWebQuestion
Hit@182.5
9
Generalization Knowledge Base Question AnsweringGrailQA
Hit@172.9
9
Showing 10 of 11 rows

Other info

Follow for update