Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

About

Humans understand long and complex texts by relying on a holistic semantic representation of the content. This global view helps organize prior knowledge, interpret new information, and integrate evidence dispersed across a document, as revealed by the Mindscape-Aware Capability of humans in psychology. Current Retrieval-Augmented Generation (RAG) systems lack such guidance and therefore struggle with long-context tasks. In this paper, we propose Mindscape-Aware RAG (MiA-RAG), the first framework to formulate mindscape-aware retrieval and generation as a unified conditioning paradigm for LLM-based RAG. MiA-RAG builds a mindscape through hierarchical summarization and conditions both retrieval and generation on this global semantic representation. This enables the retriever to form enriched query embeddings and the generator to reason over retrieved evidence within a coherent global context. We evaluate MiA-RAG across diverse long-context and bilingual benchmarks for evidence-based understanding and global sense-making. It consistently surpasses baselines, and further analysis shows that it aligns local details with a coherent global representation, enabling more human-like long-context retrieval and reasoning.

Yuqing Li, Jiangnan Li, Zheng Lin, Ziyan Zhou, Junjie Wu, Weiping Wang, Jie Zhou, Mo Yu• 2025

Related benchmarks

TaskDatasetResultRank
Long-context Question AnsweringNarrativeQA
F1 Score53.56
38
Long-context Question AnsweringDetectiveQA-En
Accuracy75.5
38
Long narrative understanding QANoCha
Pair Accuracy55.56
38
Long-context Question AnsweringDetectiveQA-ZH
Accuracy0.8417
38
Long-context Reasoning∞Bench
Accuracy90.39
32
Question AnsweringNarrativeQA Helmet benchmark
F1 Score49.5
9
RetrievalDetectiveQA-ZH
R@346.8
6
Showing 7 of 7 rows

Other info

Follow for update