Equivariant Volumetric Grasping
About
We propose a new volumetric grasp model that is equivariant to rotations around the vertical axis, leading to a significant improvement in sampling efficiency. Our model employs a tri-plane volumetric feature representation -- i.e., the projection of 3D features onto three canonical planes. We introduce a novel tri-plane feature design in which features on the horizontal plane are \emph{equivariant} to $90^\circ$ rotations, while the \emph{sum} of features from the other two planes remains \emph{invariant} to reflections induced by the same transformations. We further develop equivariant adaptations of two state-of-the-art volumetric grasp planners, GIGA and IGD. Specifically, we derive a new equivariant formulation of IGD's deformable attention mechanism and propose an equivariant generative model of grasp orientations based on flow matching. We provide a detailed analytical justification of the proposed equivariance properties and validate our approach through extensive simulated and real-world experiments. Our results demonstrate that the proposed projection-based design reduces both computational and memory costs. Moreover, the equivariant grasp models built on top of our tri-plane features consistently outperform their non-equivariant counterparts, achieving higher performance within a real-time cost constraint. Video and code can be viewed in: https://mousecpn.github.io/evg-page/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clutter removal | Packed scenes single-view, fixed camera, gamma noise | GSR97.4 | 16 | |
| Clutter removal | Pile scenes single-view, fixed camera, gamma noise | GSR78.6 | 16 | |
| Clutter removal | Packed single-view, random camera pose, Gaussian noise | GSR95.8 | 10 | |
| Clutter removal | Pile single-view, random camera pose, Gaussian noise | GSR90.1 | 10 | |
| Clutter removal | Real-world Packed | GSR89.9 | 7 | |
| Clutter removal | Pile Real-world | GSR (%)79.3 | 7 | |
| Clutter removal | Real-world Adv | GSR88.1 | 7 |