Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Taming Various Privilege Escalation in LLM-Based Agent Systems: A Mandatory Access Control Framework

About

Large Language Model (LLM)-based agent systems are increasingly deployed for complex real-world tasks but remain vulnerable to natural language-based attacks that exploit over-privileged tool use. This paper aims to understand and mitigate such attacks through the lens of privilege escalation, defined as agent actions exceeding the least privilege required for a user's intended task. Based on a formal model of LLM agent systems, we identify novel privilege escalation scenarios, particularly in multi-agent systems, including a variant akin to the classic confused deputy problem. To defend against both known and newly demonstrated privilege escalation, we propose SEAgent, a mandatory access control (MAC) framework built upon attribute-based access control (ABAC). SEAgent monitors agent-tool interactions via an information flow graph and enforces customizable security policies based on entity attributes. Our evaluations show that SEAgent effectively blocks various privilege escalation while maintaining a low false positive rate and negligible system overhead. This demonstrates its robustness and adaptability in securing LLM-based agent systems.

Zimo Ji, Daoyuan Wu, Wenyuan Jiang, Pingchuan Ma, Zongjie Li, Yudong Gao, Shuai Wang, Yingjiu Li• 2026

Related benchmarks

TaskDatasetResultRank
RAG Poisoning DefenseExtended InjecAgent RAG Poisoning
ASR0.00e+0
12
Single-agent tool useAPI-Bank reconstructed
Correctness74.73
9
App Compromise DefenseExtended InjecAgent App Compromise
ASR0.00e+0
9
App Data Stealing DefenseExtended InjecAgent App Data Stealing
ASR0.00e+0
9
Indirect Prompt Injection DefenseAgentDojo
Banking Defense Rate0.00e+0
3
Multi-agent Task FulfillmentAWS Benchmark Travel scenario
User GSR78.79
2
Multi-agent Task FulfillmentAWS Benchmark Mortgage scenario
Goal Success Rate (User)62.07
2
Showing 7 of 7 rows

Other info

Follow for update