StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization
About
The integration of large language models (LLMs) into information retrieval systems introduces new attack surfaces, particularly for adversarial ranking manipulations. We present $\textbf{StealthRank}$, a novel adversarial attack method that manipulates LLM-driven ranking systems while maintaining textual fluency and stealth. Unlike existing methods that often introduce detectable anomalies, StealthRank employs an energy-based optimization framework combined with Langevin dynamics to generate StealthRank Prompts (SRPs)-adversarial text sequences embedded within item or document descriptions that subtly yet effectively influence LLM ranking mechanisms. We evaluate StealthRank across multiple LLMs, demonstrating its ability to covertly boost the ranking of target items while avoiding explicit manipulation traces. Our results show that StealthRank consistently outperforms state-of-the-art adversarial ranking baselines in both effectiveness and stealth, highlighting critical vulnerabilities in LLM-driven ranking systems. Our code is publicly available at $\href{https://github.com/Tangyiming205069/controllable-seo}{here}$.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Output Ranking | ProductBench Home & Kitchen | Top-5 Accuracy75 | 28 | |
| Promotion Success | Electronics | Top-5 Accuracy0.74 | 28 | |
| Promotion Success Rate | Tools & Home Improvement | Top-5 Accuracy77 | 28 | |
| Product Ranking | Product categories dataset | Avg Rank Change-0.73 | 4 |