SE(3)-Equivariant Relational Rearrangement with Neural Descriptor Fields
About
We present a method for performing tasks involving spatial relations between novel object instances initialized in arbitrary poses directly from point cloud observations. Our framework provides a scalable way for specifying new tasks using only 5-10 demonstrations. Object rearrangement is formalized as the question of finding actions that configure task-relevant parts of the object in a desired alignment. This formalism is implemented in three steps: assigning a consistent local coordinate frame to the task-relevant object parts, determining the location and orientation of this coordinate frame on unseen object instances, and executing an action that brings these frames into the desired alignment. We overcome the key technical challenge of determining task-relevant local coordinate frames from a few demonstrations by developing an optimization method based on Neural Descriptor Fields (NDFs) and a single annotated 3D keypoint. An energy-based learning scheme to model the joint configuration of the objects that satisfies a desired relational task further improves performance. The method is tested on three multi-object rearrangement tasks in simulation and on a real robot. Project website, videos, and code: https://anthonysimeonov.github.io/r-ndf/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Pick-&-Place | Unseen Mug T1 | Grasp Success Rate90 | 10 | |
| Pick-&-Place | Unseen Bowl T2 | Grasp Success Rate90 | 10 | |
| Pick-&-Place | Unseen Bottle T3 | Grasp Success Rate53 | 10 | |
| Pick-&-Place | Mean Across T1, T2, T3 | Mean Grasp Success Rate78 | 10 | |
| Placing a bottle in a container (T6) | R-NDF simulation Upright pose | Success Rate80 | 9 | |
| Hanging a mug on the hook of a rack (T4) | R-NDF simulation Upright pose | Success Rate71 | 9 | |
| Hanging a mug on the hook of a rack (T4) | R-NDF simulation (Arbitrary pose) | Success Rate (T4)55 | 9 | |
| Placing a bottle in a container (T6) | R-NDF simulation (Arbitrary pose) | Success Rate54 | 9 | |
| Placing a bowl on a mug (T5) | R-NDF simulation Upright pose | Success Rate75 | 9 | |
| Placing a bowl on a mug (T5) | R-NDF simulation (Arbitrary pose) | Success Rate75 | 9 |