TrojanRobot: Physical-world Backdoor Attacks Against VLM-based Robotic Manipulation
About
Robotic manipulation in the physical world is increasingly empowered by \textit{large language models} (LLMs) and \textit{vision-language models} (VLMs), leveraging their understanding and perception capabilities. Recently, various attacks against such robotic policies have been proposed, with backdoor attacks drawing considerable attention for their high stealth and strong persistence capabilities. However, existing backdoor efforts are limited to simulators and suffer from physical-world realization. To address this, we propose \textit{TrojanRobot}, a highly stealthy and broadly effective robotic backdoor attack in the physical world. Specifically, we introduce a module-poisoning approach by embedding a backdoor module into the modular robotic policy, enabling backdoor control over the policy's visual perception module thereby backdooring the entire robotic policy. Our vanilla implementation leverages a backdoor-finetuned VLM to serve as the backdoor module. To enhance its generalization in physical environments, we propose a prime implementation, leveraging the LVLM-as-a-backdoor paradigm and developing three types of prime attacks, \ie, \textit{permutation}, \textit{stagnation}, and \textit{intentional} attacks, thus achieving finer-grained backdoors. Extensive experiments on the UR3e manipulator with 18 task instructions using robotic policies based on four VLMs demonstrate the broad effectiveness and physical-world stealth of TrojanRobot. Our attack's video demonstrations are available via a github link https://trojanrobot.github.io.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robotic Manipulation (Long) | LIBERO Long 1.0 (test) | SR89.3 | 20 | |
| Robotic Manipulation (Object) | LIBERO-Object 1.0 (test) | Success Rate (SR)89.4 | 20 | |
| Robotic Manipulation (Goal) | LIBERO-Goal 1.0 (test) | SR0.881 | 20 | |
| Robotic Manipulation (Spatial) | LIBERO-Spatial 1.0 (test) | SR90.1 | 20 | |
| Button Pressing | VLA Backdoor Evaluation Tasks | Success Rate0.913 | 20 | |
| Drawer Opening | VLA Backdoor | Success Rate (SR)88.2 | 20 | |
| Peg Insertion | VLA Backdoor Evaluation Tasks | SR92.6 | 20 | |
| Tennis Pushing | VLA Backdoor Evaluation Tasks | Success Rate90.7 | 20 | |
| Pick-&-Place | VLA Backdoor Evaluation Tasks | SR0.891 | 20 |