Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

About

Automatic detection and prevention of open-set failures are crucial in closed-loop robotic systems. Recent studies often struggle to simultaneously identify unexpected failures reactively after they occur and prevent foreseeable ones proactively. To this end, we propose Code-as-Monitor (CaM), a novel paradigm leveraging the vision-language model (VLM) for both open-set reactive and proactive failure detection. The core of our method is to formulate both tasks as a unified set of spatio-temporal constraint satisfaction problems and use VLM-generated code to evaluate them for real-time monitoring. To enhance the accuracy and efficiency of monitoring, we further introduce constraint elements that abstract constraint-related entities or their parts into compact geometric elements. This approach offers greater generality, simplifies tracking, and facilitates constraint-aware visual programming by leveraging these elements as visual prompts. Experiments show that CaM achieves a 28.7% higher success rate and reduces execution time by 31.8% under severe disturbances compared to baselines across three simulators and a real-world setting. Moreover, CaM can be integrated with open-loop control policies to form closed-loop systems, enabling long-horizon tasks in cluttered scenes with dynamic environments.

Enshen Zhou, Qi Su, Cheng Chi, Zhizheng Zhang, Zhongyuan Wang, Tiejun Huang, Lu Sheng, He Wang• 2024

Related benchmarks

TaskDatasetResultRank
Autonomous Driving EvaluationDriveLM VRU-Accident benchmark
GAR62.7
10
Dual-arm pour waterDual-arm pour water Simulation v1 (test)
Success Rate95
10
Single-arm pour waterSingle-arm pour water Simulation v1 (test)
Success Rate100
10
Rearrange tableRearrange table Simulation v1 (test)
Success Rate100
10
Robotic manipulation with disturbancesSetup coffee tray
Success Rate60
5
Autonomous Driving SafetyVRU-Accident (Clean)
GAR62.7
5
Autonomous Driving SafetyVRU-Accident (ADvLM attack)
GAR79.7
5
Autonomous Driving SafetyVRU-Accident CAD attack
GAR77.7
5
Robotic manipulation with disturbancesSingle-arm pour water
Success Rate90
5
Robotic manipulation with disturbancesDual-arm pour water
Success Rate70
5
Showing 10 of 15 rows

Other info

Follow for update