Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning in Action: MCTS-Driven Knowledge Retrieval for Large Language Models

About

Large language models (LLMs) typically enhance their performance through either the retrieval of semantically similar information or the improvement of their reasoning capabilities. However, a significant challenge remains in effectively integrating both retrieval and reasoning strategies to optimize LLM performance. In this paper, we introduce a reasoning-aware knowledge retrieval method that enriches LLMs with information aligned to the logical structure of conversations, moving beyond surface-level semantic similarity. We follow a coarse-to-fine approach for knowledge retrieval. First, we identify a contextually relevant sub-region of the knowledge base, ensuring that all sentences within it are relevant to the context topic. Next, we refine our search within this sub-region to extract knowledge that is specifically relevant to the reasoning process. Throughout both phases, we employ the Monte Carlo Tree Search-inspired search method to effectively navigate through knowledge sentences using common keywords. Experiments on two multi-turn dialogue datasets demonstrate that our knowledge retrieval approach not only aligns more closely with the underlying reasoning in human conversations but also significantly enhances the diversity of the retrieved knowledge, resulting in more informative and creative responses.

Shuqi Liu, Bowei He, Chen Ma, Linqi Song• 2025

Related benchmarks

TaskDatasetResultRank
Knowledge retrievalDailyDialog
BERTScore Precision (avg)84.29
16
Knowledge retrievalEmpathetic Dialogue (EMP)
BERTScore Precision (avg)85.27
16
Human Logic AlignmentDailyDialog
Human Logic Alignment (T=0.5)80.97
9
Human Logic AlignmentEmpathetic Dialogue
HLA (T=0.5)85.2
9
Knowledge Pair-wise DiversityDailyDialog (test)
Precision86.37
9
Knowledge Pair-wise DiversityEmpathetic Dialogue (test)
Precision86.53
9
Response GenerationDailyDialog
Pairwise Diversity78.5
6
Response GenerationEmpathetic Dialogue
Pairwise Diversity75
6
Dialogue Response GenerationDailyDialog
Coherence4.08
3
Dialogue Response GenerationEmpathetic Dialogue
Coherence3.98
3
Showing 10 of 10 rows

Other info

Follow for update