Reasoning in Action: MCTS-Driven Knowledge Retrieval for Large Language Models

About

Large language models (LLMs) typically enhance their performance through either the retrieval of semantically similar information or the improvement of their reasoning capabilities. However, a significant challenge remains in effectively integrating both retrieval and reasoning strategies to optimize LLM performance. In this paper, we introduce a reasoning-aware knowledge retrieval method that enriches LLMs with information aligned to the logical structure of conversations, moving beyond surface-level semantic similarity. We follow a coarse-to-fine approach for knowledge retrieval. First, we identify a contextually relevant sub-region of the knowledge base, ensuring that all sentences within it are relevant to the context topic. Next, we refine our search within this sub-region to extract knowledge that is specifically relevant to the reasoning process. Throughout both phases, we employ the Monte Carlo Tree Search-inspired search method to effectively navigate through knowledge sentences using common keywords. Experiments on two multi-turn dialogue datasets demonstrate that our knowledge retrieval approach not only aligns more closely with the underlying reasoning in human conversations but also significantly enhances the diversity of the retrieved knowledge, resulting in more informative and creative responses.

Shuqi Liu, Bowei He, Chen Ma, Linqi Song• 2025

Related benchmarks

Task	Dataset	Result
Knowledge retrieval	DailyDialog	BERTScore Precision (avg)84.29	16
Knowledge retrieval	Empathetic Dialogue (EMP)	BERTScore Precision (avg)85.27	16
Human Logic Alignment	DailyDialog	Human Logic Alignment (T=0.5)80.97	9
Human Logic Alignment	Empathetic Dialogue	HLA (T=0.5)85.2	9
Knowledge Pair-wise Diversity	DailyDialog (test)	Precision86.37	9
Knowledge Pair-wise Diversity	Empathetic Dialogue (test)	Precision86.53	9
Response Generation	DailyDialog	Pairwise Diversity78.5	6
Response Generation	Empathetic Dialogue	Pairwise Diversity75	6
Dialogue Response Generation	DailyDialog	Coherence4.08	3
Dialogue Response Generation	Empathetic Dialogue	Coherence3.98	3

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord