Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds
About
6D robotic grasping beyond top-down bin-picking scenarios is a challenging task. Previous solutions based on 6D grasp synthesis with robot motion planning usually operate in an open-loop setting, which are sensitive to grasp synthesis errors. In this work, we propose a new method for learning closed-loop control policies for 6D grasping. Our policy takes a segmented point cloud of an object from an egocentric camera as input, and outputs continuous 6D control actions of the robot gripper for grasping the object. We combine imitation learning and reinforcement learning and introduce a goal-auxiliary actor-critic algorithm for policy learning. We demonstrate that our learned policy can be integrated into a tabletop 6D grasping system and a human-robot handover system to improve the grasping performance of unseen objects. Our videos and code can be found at https://sites.google.com/view/gaddpg .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Human-to-Robot Handover | HandoverSim Simultaneous setting s0 (test) | Success Rate (%)54.86 | 15 | |
| Human-to-Robot Handover | HandoverSim s0 (Sequential) v1 (test) | Success Rate54.76 | 13 | |
| Robot Handover | HandoverSim Sequential (s0) | Success Rate (S)54.76 | 13 | |
| Human-to-Robot Handover | GenH2R-Sim t1 v1 (test) | Success Rate46.7 | 12 | |
| Robot Handover | HandoverSim Simultaneous (s0) | Success Rate (S)44.68 | 12 | |
| Robot Handover | GenH2R-Sim (t1) | Success Rate (%)46.7 | 12 | |
| Human-to-Robot Handover | GenH2R-Sim t0 v1 (test) | Success Score (S)24.05 | 12 | |
| Robot Handover | GenH2R-Sim (t0) | Success Rate24.05 | 12 | |
| 6-DoF Grasping | YCB (held-out objects) | Success Rate88.2 | 10 | |
| 6-DoF Grasping | ShapeNet (SN) (held-out objects) | SR (%)91.3 | 10 |