Efficient Heatmap-Guided 6-Dof Grasp Detection in Cluttered Scenes
About
Fast and robust object grasping in clutter is a crucial component of robotics. Most current works resort to the whole observed point cloud for 6-Dof grasp generation, ignoring the guidance information excavated from global semantics, thus limiting high-quality grasp generation and real-time performance. In this work, we show that the widely used heatmaps are underestimated in the efficiency of 6-Dof grasp generation. Therefore, we propose an effective local grasp generator combined with grasp heatmaps as guidance, which infers in a global-to-local semantic-to-point way. Specifically, Gaussian encoding and the grid-based strategy are applied to predict grasp heatmaps as guidance to aggregate local points into graspable regions and provide global semantic information. Further, a novel non-uniform anchor sampling mechanism is designed to improve grasp accuracy and diversity. Benefiting from the high-efficiency encoding in the image space and focusing on points in local graspable regions, our framework can perform high-quality grasp detection in real-time and achieve state-of-the-art results. In addition, real robot experiments demonstrate the effectiveness of our method with a success rate of 94% and a clutter completion rate of 100%. Our code is available at https://github.com/THU-VCLab/HGGD.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Grasp Detection | GraspNet-1Billion (RealSense) | AP (Average)44.24 | 32 | |
| Grasp Detection | GraspNet-1Billion RealSense Novel | AP24.59 | 25 | |
| Grasp Detection | GraspNet-1Billion RealSense (Seen) | AP64.45 | 25 | |
| Grasp Detection | GraspNet-1Billion RealSense Similar | AP0.5359 | 25 | |
| Grasp Detection | GraspNet-1Billion Kinect camera (seen) | AP61.17 | 23 | |
| Grasp Detection | GraspNet-1Billion Kinect camera (Novel) | AP19.37 | 13 | |
| Grasp Detection | GraspNet-1Billion Kinect camera (Similar split) | AP28.77 | 13 | |
| Grasp Pose Detection | GraspNet-1Billion Kinect 1.0 | AP (Seen)60.26 | 12 | |
| Grasp Detection | GraspNet-1Billion Kinect | AP (Seen)0.6026 | 9 | |
| 6-Dof Grasp Detection | TS-ACRONYM (test) | CFR98.2 | 6 |