Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

About

Large language models (LLMs) have advanced in large strides due to the effectiveness of the self-attention mechanism that processes and compares all tokens at once. However, this mechanism comes with a fundamental issue -- the predetermined context window is bound to be limited. Despite attempts to extend the context window through methods like extrapolating the positional embedding, using recurrence, or selectively retrieving essential parts of the long sequence, long-text understanding continues to be a challenge. We propose an alternative approach which instead treats the LLM as an interactive agent, allowing it to decide how to read the text via iterative prompting. We introduce MemWalker, a method that first processes the long context into a tree of summary nodes. Upon receiving a query, the model navigates this tree in search of relevant information, and responds once it gathers sufficient information. On long-text question answering tasks our method outperforms baseline approaches that use long context windows, recurrence, and retrieval. We show that, beyond effective reading, MemWalker enhances explainability by highlighting the reasoning steps as it interactively reads the text; pinpointing the relevant text segments related to the query.

Howard Chen, Ramakanth Pasunuru, Jason Weston, Asli Celikyilmaz• 2023

Related benchmarks

TaskDatasetResultRank
Question AnsweringNarrativeQA (test)
ROUGE-L9.8
68
Question AnsweringMuSiQue (test)
F1 Score24
43
Dialogue Response GenerationChronicle
B-430.3
38
Dialogue Response GenerationMSC
B-4 Score33.4
38
Response GenerationChronicle and MSC Average
CEA49.7
30
Question AnsweringQASPER (test)
F1 Score (Match)39
27
Fact RetrievalGov Report
F1 Score23.4
13
Fact RetrievalWCEP
F1 Score21
13
Fact RetrievalAcademicEval Related-multi
F1 Score19.3
13
Fact RetrievalAcademicEval Abstract-multi
F1 Score15.5
13
Showing 10 of 17 rows

Other info

Follow for update