Graspness Discovery in Clutters for Fast and Accurate Grasp Detection

About

Efficient and robust grasp pose detection is vital for robotic manipulation. For general 6 DoF grasping, conventional methods treat all points in a scene equally and usually adopt uniform sampling to select grasp candidates. However, we discover that ignoring where to grasp greatly harms the speed and accuracy of current grasp pose detection methods. In this paper, we propose "graspness", a quality based on geometry cues that distinguishes graspable areas in cluttered scenes. A look-ahead searching method is proposed for measuring the graspness and statistical results justify the rationality of our method. To quickly detect graspness in practice, we develop a neural network named cascaded graspness model to approximate the searching process. Extensive experiments verify the stability, generality and effectiveness of our graspness model, allowing it to be used as a plug-and-play module for different methods. A large improvement in accuracy is witnessed for various previous methods after equipping our graspness model. Moreover, we develop GSNet, an end-to-end network that incorporates our graspness model for early filtering of low-quality predictions. Experiments on a large-scale benchmark, GraspNet-1Billion, show that our method outperforms previous arts by a large margin (30+ AP) and achieves a high inference speed. The library of GSNet has been integrated into AnyGrasp, which is at https://github.com/graspnet/anygrasp_sdk.

Chenxi Wang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Jin Gao, Cewu Lu• 2024

Related benchmarks

Task	Dataset	Result
Grasp Detection	GraspNet-1Billion RealSense (Seen)	AP67.12	33
Grasp Detection	GraspNet-1Billion RealSense Similar	AP0.5481	33
Grasp Detection	GraspNet-1Billion RealSense Novel	AP24.31	33
Grasp Detection	GraspNet-1Billion (RealSense)	AP (Average)48.75	32
Grasp Detection	GraspNet-1Billion Kinect camera (seen)	AP63.5	23
Clutter removal	Packed scenes single-view, fixed camera, gamma noise	GSR67.8	16
Clutter removal	Pile scenes single-view, fixed camera, gamma noise	GSR58.3	16
Grasp Detection	GraspNet-1Billion Kinect camera (Novel)	AP19.78	13
Grasp Detection	GraspNet-1Billion Kinect camera (Similar split)	AP29.61	13
Grasp Pose Detection	GraspNet-1Billion Kinect 1.0	AP (Seen)63.5	12

Showing 10 of 22 rows

Other info

Code

Follow for update

@wizwand_team Discord