Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Doc-to-LoRA: Learning to Instantly Internalize Contexts

About

Long input sequences are central to in-context learning, document understanding, and multi-step reasoning of Large Language Models (LLMs). However, the quadratic attention cost of Transformers makes inference memory-intensive and slow. While context distillation (CD) can transfer information into model parameters, per-prompt distillation is impractical due to training costs and latency. To address these limitations, we propose Doc-to-LoRA (D2L), a lightweight hypernetwork that meta-learns to perform approximate CD within a single forward pass. Given an unseen prompt, D2L generates a LoRA adapter for a target LLM, enabling subsequent queries to be answered without re-consuming the original context, reducing latency and KV-cache memory consumption during inference of the target LLM. On a long-context needle-in-a-haystack task, D2L successfully learns to map contexts into adapters that store the needle information, achieving near-perfect zero-shot accuracy at sequence lengths exceeding the target LLM's native context window by more than 4x. On real-world QA datasets with limited compute, D2L outperforms standard CD while significantly reducing peak memory consumption and update latency. We envision that D2L can facilitate rapid adaptation of LLMs, opening up the possibility of frequent knowledge updates and personalized chat behavior.

Rujikorn Charakorn, Edoardo Cetin, Shinnosuke Uesaka, Robert Tjarko Lange• 2026

Related benchmarks

TaskDatasetResultRank
Multihop Question Answering2WikiMultihopQA
Performance0.857
5
Question AnsweringMultifieldQA
Rel. Perf vs Truncated ICL0.485
5
Knowledge Conflict ResolutionKID-Bench v2
Performance (Difficulty A)96.7
4
Knowledge Conflict ResolutionRippleEdits style 40 q
Accuracy42.5
4
Knowledge CombinationKID-Bench Category B v2
Accuracy68
3
Novel Knowledge RecallKID-Bench Category A v2
Accuracy96.7
3
Question AnsweringSQuAD
ROUGE-L Recall83.5
3
Knowledge Conflict ResolutionHeld-out 30 q
Accuracy50
3
Knowledge Conflict ResolutionCounterFact-style 40 q
Accuracy42.5
3
Knowledge Conflict ResolutionCounterFact 500
Accuracy92.8
3
Showing 10 of 11 rows

Other info

Follow for update