Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics

About

Recently in robotics, Vision-Language-Action (VLA) models have emerged as a transformative approach, enabling robots to execute complex tasks by integrating visual and linguistic inputs within an end-to-end learning framework. Despite their significant capabilities, VLA models introduce new attack surfaces. This paper systematically evaluates their robustness. Recognizing the unique demands of robotic execution, our attack objectives target the inherent spatial and functional characteristics of robotic systems. In particular, we introduce two untargeted attack objectives that leverage spatial foundations to destabilize robotic actions, and a targeted attack objective that manipulates the robotic trajectory. Additionally, we design an adversarial patch generation approach that places a small, colorful patch within the camera's view, effectively executing the attack in both digital and physical environments. Our evaluation reveals a marked degradation in task success rates, with up to a 100\% reduction across a suite of simulated robotic tasks, highlighting critical security gaps in current VLA architectures. By unveiling these vulnerabilities and proposing actionable evaluation metrics, we advance both the understanding and enhancement of safety for VLA-based robotic systems, underscoring the necessity for continuously developing robust defense strategies prior to physical-world deployments.

Taowen Wang, Cheng Han, James Chenhao Liang, Wenhao Yang, Dongfang Liu, Luna Xinyu Zhang, Qifan Wang, Jiebo Luo, Ruixiang Tang• 2024

Related benchmarks

TaskDatasetResultRank
Robot ManipulationLIBERO (test)
Average Success Rate27.4
184
Robot ManipulationLIBERO Object--
70
Robot ManipulationLIBERO Spatial--
41
Robot ManipulationLIBERO simulation
Average Success Rate6.9
36
Robot ManipulationLIBERO Goal--
36
Robotic ManipulationLIBERO-10--
27
Robot ManipulationLIBERO Simulated 1.0 (test)
Spatial Success Rate100
24
Robot ManipulationLIBERO Physical 1.0 (test)
Spatial Success Rate98
24
Robotic ManipulationLIBERO Physical
Spatial Success Rate10.4
9
VLA HijackingInstructVLA 5 tasks
ASR6.77
5
Showing 10 of 13 rows

Other info

Follow for update