Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge Capsules: Structured Nonparametric Memory Units for LLMs

About

Large language models (LLMs) encode knowledge in parametric weights, making it costly to update or extend without retraining. Retrieval-augmented generation (RAG) mitigates this limitation by appending retrieved text to the input, but operates purely through context expansion, where external knowledge competes as tokens within the attention mechanism. As a result, its influence is indirect and often unstable, particularly in long context and multi hop reasoning scenarios. We propose Knowledge Capsules, structured nonparametric memory units that represent normalized relational knowledge and can be constructed directly from document corpora using a frozen base model. Instead of injecting knowledge as text, we introduce an External Key Value Injection (KVI) framework that compiles capsules into attention-compatible key value representations, enabling external knowledge to directly participate in the model's attention computation. By shifting knowledge integration from context-level augmentation to memory level interaction, the proposed framework consistently outperforms RAG and GraphRAG across multiple QA benchmarks, with improved stability and accuracy in long context and multi hop reasoning, while requiring no parameter updates.

Bin Ju, Shenfeng Weng, Danying Zhou, Rongkai Xu, Kunkai Su• 2026

Related benchmarks

TaskDatasetResultRank
Question AnsweringHotpotQA
EM40.8
173
Question AnsweringNQ
EM67
45
Question AnsweringMedHopQA n40
Exact Match92.5
10
Question AnsweringMedHopQA official
Exact Match (EM)75.4
10
Showing 4 of 4 rows

Other info

Follow for update