Real-Time Grasp Detection Using Convolutional Neural Networks
About
We present an accurate, real-time approach to robotic grasp detection based on convolutional neural networks. Our network performs single-stage regression to graspable bounding boxes without using standard sliding window or region proposal techniques. The model outperforms state-of-the-art approaches by 14 percentage points and runs at 13 frames per second on a GPU. Our network can simultaneously perform classification so that in a single step it recognizes the object and finds a good grasp rectangle. A modification to this model predicts multiple grasps per object by using a locally constrained prediction mechanism. The locally constrained model performs significantly better, especially on objects that can be grasped in a variety of ways.
Joseph Redmon, Anelia Angelova• 2014
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Grasp Detection | Cornell Dataset (object-wise) | Accuracy87.1 | 39 | |
| Grasp Detection | Cornell Dataset image-wise | Accuracy88 | 25 | |
| Grasp Detection | Cornell image-wise | Accuracy88 | 24 | |
| Grasp Detection | Cornell Grasping Dataset (Image-wise split) | Detection Accuracy88 | 17 | |
| Robotic Grasp Detection | Cornell Grasp Dataset (Object-wise) | Accuracy87.1 | 14 | |
| Grasp Detection | Cornell Grasping Dataset (Object-wise split) | -- | 8 | |
| Grasp Prediction | Cornell Grasping Dataset | Speed (fps)3.31 | 5 | |
| Object Classification | Cornell Dataset image-wise | Accuracy90 | 2 | |
| Object Classification | Cornell Dataset (object-wise) | Accuracy61.5 | 2 |
Showing 9 of 9 rows