Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hidden in Memory: Sleeper Memory Poisoning in LLM Agents

About

Large language models are increasingly augmented with persistent memory, allowing assistants to store user-specific information across sessions for personalization and continuity. This statefulness introduces a new security risk: adversarial content can corrupt what an assistant remembers and thereby influence future interactions. We propose and study sleeper memory poisoning, a delayed attack in which an adversary manipulates external context, such as a document, webpage, or repository, to cause the assistant to store a fabricated memory about the user. Unlike conventional prompt injection, the attack can remain dormant and re-emerge across multiple later conversations. We evaluate the full attack pipeline: whether poisoned memories are written, later retrieved, and ultimately used to steer the following conversations. Across stateful LLM assistants, poisoned memories were added up to 99.8% on GPT-5.5 and 95% on Kimi-K2.6. Crucially, among successful retrievals, poisoned memories cause attacker-intended agentic actions in 60-89% of evaluations across models. These results show that persistent memory can act as a long-term attack surface across multiple future conversations.

Sidharth Pulipaka, Stanislau Hlebik, Leonidas Raghav, Sahar Abdelnabi, Vyas Raina, Ivaxi Sheth, Mario Fritz• 2026

Related benchmarks

TaskDatasetResultRank
Indirect Prompt InjectionLLM Behavior Subset 1
IR99.8
24
Indirect Prompt InjectionAgent Action Subset 2
IR5.5
24
Agent ActionAgent Action subset--
12
LLM BehaviorLLM Behavior--
12
Prompt Injection Attack SuccessLLM Behavior--
10
Prompt Injection Attack SuccessAgent Action--
10
Indirect Prompt Injection Attack Success EvaluationLLM Behavior Goal-Adjacent--
5
Indirect Prompt Injection Attack Success EvaluationLLM Behavior Goal-Distant--
5
Indirect Prompt Injection Attack Success EvaluationAgent Action Goal-Adjacent 2--
5
Indirect Prompt Injection Attack Success EvaluationAgent Action Goal-Distant 2--
5
Showing 10 of 10 rows

Other info

Follow for update