Hidden in Memory: Sleeper Memory Poisoning in LLM Agents

About

Large language models are increasingly augmented with persistent memory, allowing assistants to store user-specific information across sessions for personalization and continuity. This statefulness introduces a new security risk: adversarial content can corrupt what an assistant remembers and thereby influence future interactions. We propose and study sleeper memory poisoning, a delayed attack in which an adversary manipulates external context, such as a document, webpage, or repository, to cause the assistant to store a fabricated memory about the user. Unlike conventional prompt injection, the attack can remain dormant and re-emerge across multiple later conversations. We evaluate the full attack pipeline: whether poisoned memories are written, later retrieved, and ultimately used to steer the following conversations. Across stateful LLM assistants, poisoned memories were added up to 99.8% on GPT-5.5 and 95% on Kimi-K2.6. Crucially, among successful retrievals, poisoned memories cause attacker-intended agentic actions in 60-89% of evaluations across models. These results show that persistent memory can act as a long-term attack surface across multiple future conversations.

Sidharth Pulipaka, Stanislau Hlebik, Leonidas Raghav, Sahar Abdelnabi, Vyas Raina, Ivaxi Sheth, Mario Fritz• 2026

Related benchmarks

Task	Dataset	Result
Indirect Prompt Injection	LLM Behavior Subset 1	IR99.8	24
Indirect Prompt Injection	Agent Action Subset 2	IR5.5	24
Agent Action	Agent Action subset	--	12
LLM Behavior	LLM Behavior	--	12
Prompt Injection Attack Success	LLM Behavior	--	10
Prompt Injection Attack Success	Agent Action	--	10
Indirect Prompt Injection Attack Success Evaluation	LLM Behavior Goal-Adjacent	--	5
Indirect Prompt Injection Attack Success Evaluation	LLM Behavior Goal-Distant	--	5
Indirect Prompt Injection Attack Success Evaluation	Agent Action Goal-Adjacent 2	--	5
Indirect Prompt Injection Attack Success Evaluation	Agent Action Goal-Distant 2	--	5

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord