Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GSAM: A Generalizable and Safe Robotic Framework for Articulated Object Manipulation

About

Articulated object manipulation is a unique challenge for service robots. Existing methods employ end-to-end policy learning, visionmotion planning, and large-language/visual-language model (LLM/VLM), but often overlook the diversity of articulated objects and the complexity of interactions between end-effector and handle, leading to limited generalization and destructive collisions. To address this, we propose GSAM, a generalizable and safe robotic framework for articulated object manipulation. Specifically, a vision-based perceiver generates the kinematic parameters. Considering that pre-trained markers in perceiver yield raw estimations that may deviate from commonsense, we present a f ine-tuned VLM-based refiner, using chain-of-thought (COT) commonsense reasoning to refine perception. To prevent destructive collisions, we design an interaction constraint function generator, integrating articulated object, interaction pose, and obstacle avoidance knowledge into a base. LLM then functionalize these constraints and apply them to trajectory and posture planning. A kinematic-aware manipulation planner verifies reachability for trajectory and posture. Experiments on 50 hinge tasks across 5 object categories and 50 randomly initialized end-effectorhandle configurations show that GSAM reduces standard deviation by 3.1% and improves manipulation success rate by 36.0% compared to the best baseline, respectively demonstrating the superior object generalization and interaction safety of GSAM in practical scenarios.

Beichen Shao, Mengying Xie, Heng Su, Wanyi Zhang, Mingyan Li, Yan Ding, Fausto Giunchiglia, Chao Chen• 2026

Related benchmarks

TaskDatasetResultRank
Articulated Object ManipulationReal-robot manipulation trials Right Hinge
OSR80
9
Articulated Object ManipulationReal-robot manipulation trials Prismatic Hinge
OSR100
9
Articulated Object ManipulationReal-robot manipulation trials Left Hinge
OSR90
9
Articulated Object ManipulationReal-robot manipulation trials Textured Hinge
OSR80
9
Articulated Object ManipulationReal-robot manipulation trials Mean across 50 tasks
Overall Success Rate (OSR)88
9
Articulated Object Manipulation50 tasks in campus environments
Right Hinge Time (s)35.4
9
Articulated Object ManipulationReal-robot manipulation trials Bottom Hinge
OSR90
8
Articulated Object Axis EstimationCampus-scale 50 tasks (test)
Right Hinge Axis EA-Score82
4
Articulated Object SegmentationCampus-scale 50 tasks (test)
Right Hinge Mask IoU79.8
3
Showing 9 of 9 rows

Other info

Follow for update