Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MG-Grasp: Metric-Scale Geometric 6-DoF Grasping Framework with Sparse RGB Observations

About

Single-view RGB-D grasp detection remains a common choice in 6-DoF robotic grasping systems, which typically requires a depth sensor. While RGB-only 6-DoF grasp methods has been studied recently, their inaccurate geometric representation is not directly suitable for physically reliable robotic manipulation, thereby hindering reliable grasp generation. To address these limitations, we propose MG-Grasp, a novel depth-free 6-DoF grasping framework that achieves high-quality object grasping. Leveraging two-view 3D foundation model with camera intrinsic/extrinsic, our method reconstructs metric-scale and multi-view consistent dense point clouds from sparse RGB images and generates stable 6-DoF grasp. Experiments on GraspNet-1Billion dataset and real world demonstrate that MG-Grasp achieves state-of-the-art (SOTA) grasp performance among RGB-based 6-DoF grasping methods.

Kangxu Wang, Siang Chen, Chenxing Jiang, Shaojie Shen, Yixiang Dai, Guijin Wang• 2026

Related benchmarks

TaskDatasetResultRank
Grasp DetectionGraspNet-1Billion RealSense Similar
AP0.5603
33
Grasp DetectionGraspNet-1Billion RealSense (Seen)
AP63.7
33
Grasp DetectionGraspNet-1Billion RealSense Novel
AP23.22
33
6-DoF GraspingGraspNet-1Billion Average RealSense
AP47.65
8
6-DoF GraspingGraspNet-1Billion Seen Kinect
AP66.8
8
6-DoF GraspingGraspNet-1Billion Similar Kinect
AP57.35
8
6-DoF GraspingGraspNet-1Billion Novel Kinect
AP20.47
8
6-DoF GraspingGraspNet-1Billion Average Kinect
AP48.21
8
6-DoF GraspingGraspNet RealSense Kinect
Time (s)2.86
3
Showing 9 of 9 rows

Other info

Follow for update