Real-Time Grasp Detection Using Convolutional Neural Networks

About

We present an accurate, real-time approach to robotic grasp detection based on convolutional neural networks. Our network performs single-stage regression to graspable bounding boxes without using standard sliding window or region proposal techniques. The model outperforms state-of-the-art approaches by 14 percentage points and runs at 13 frames per second on a GPU. Our network can simultaneously perform classification so that in a single step it recognizes the object and finds a good grasp rectangle. A modification to this model predicts multiple grasps per object by using a locally constrained prediction mechanism. The locally constrained model performs significantly better, especially on objects that can be grasped in a variety of ways.

Joseph Redmon, Anelia Angelova• 2014

Related benchmarks

Task	Dataset	Result
Grasp Detection	Cornell Dataset (object-wise)	Accuracy87.1	39
Grasp Detection	Cornell Dataset image-wise	Accuracy88	25
Grasp Detection	Cornell image-wise	Accuracy88	24
Grasp Detection	Cornell Grasping Dataset (Image-wise split)	Detection Accuracy88	17
Robotic Grasp Detection	Cornell Grasp Dataset (Object-wise)	Accuracy87.1	14
Grasp Detection	Cornell Grasping Dataset (Object-wise split)	--	8
Grasp Prediction	Cornell Grasping Dataset	Speed (fps)3.31	5
Object Classification	Cornell Dataset image-wise	Accuracy90	2
Object Classification	Cornell Dataset (object-wise)	Accuracy61.5	2

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord