An Engorgio Prompt Makes Large Language Model Babble on
About
Auto-regressive large language models (LLMs) have yielded impressive performance in many real-world tasks. However, the new paradigm of these LLMs also exposes novel threats. In this paper, we explore their vulnerability to inference cost attacks, where a malicious user crafts Engorgio prompts to intentionally increase the computation cost and latency of the inference process. We design Engorgio, a novel methodology, to efficiently generate adversarial Engorgio prompts to affect the target LLM's service availability. Engorgio has the following two technical contributions. (1) We employ a parameterized distribution to track LLMs' prediction trajectory. (2) Targeting the auto-regressive nature of LLMs' inference process, we propose novel loss functions to stably suppress the appearance of the <EOS> token, whose occurrence will interrupt the LLM's generation process. We conduct extensive experiments on 13 open-sourced LLMs with parameters ranging from 125M to 30B. The results show that Engorgio prompts can successfully induce LLMs to generate abnormally long outputs (i.e., roughly 2-13$\times$ longer to reach 90%+ of the output length limit) in a white-box scenario and our real-world experiment demonstrates Engergio's threat to LLM service with limited computing resources. The code is released at: https://github.com/jianshuod/Engorgio-prompt.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Reasoning Token Induction | Mixed Prompts (SimpleQA, SimpleBench, AIME2024, etc.) (test) | Mean Completion Tokens731 | 31 | |
| LLM Attack Effectiveness | Qwen3-8B serving environment | TTFT (s)0.12 | 6 | |
| Inference Cost Attack | Alpaca Vicuna-7B (test) | Average Length853 | 6 | |
| LLM Attack Effectiveness | Gemma3 12B-it | TTFT (s)0.13 | 6 | |
| LLM Attack Effectiveness | DeepSeek-R1-Distill-Llama-8B serving environment | TTFT (s)0.1 | 6 | |
| Inference Cost Attack | Alpaca Samantha-7B (test) | Average Length1.15e+3 | 6 | |
| Inference Cost Attack | Alpaca Llama2-7B (test) | Average Length1.86e+3 | 6 |