Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Imitation Learning of Task-Oriented Object Grasping and Rearrangement

About

Task-oriented object grasping and rearrangement are critical skills for robots to accomplish different real-world manipulation tasks. However, they remain challenging due to partial observations of the objects and shape variations in categorical objects. In this paper, we propose the Multi-feature Implicit Model (MIMO), a novel object representation that encodes multiple spatial features between a point and an object in an implicit neural field. Training such a model on multiple features ensures that it embeds the object shapes consistently in different aspects, thus improving its performance in object shape reconstruction from partial observation, shape similarity measure, and modeling spatial relations between objects. Based on MIMO, we propose a framework to learn task-oriented object grasping and rearrangement from single or multiple human demonstration videos. The evaluations in simulation show that our approach outperforms the state-of-the-art methods for multi- and single-view observations. Real-world experiments demonstrate the efficacy of our approach in one- and few-shot imitation learning of manipulation tasks.

Yichen Cai, Jianfeng Gao, Christoph Pohl, Tamim Asfour• 2024

Related benchmarks

TaskDatasetResultRank
Pick-&-PlaceUnseen Mug T1
Grasp Success Rate100
10
Pick-&-PlaceUnseen Bowl T2
Grasp Success Rate100
10
Pick-&-PlaceUnseen Bottle T3
Grasp Success Rate97
10
Pick-&-PlaceMean Across T1, T2, T3
Mean Grasp Success Rate99
10
Hanging a mug on the hook of a rack (T4)R-NDF simulation Upright pose
Success Rate92
9
Hanging a mug on the hook of a rack (T4)R-NDF simulation (Arbitrary pose)
Success Rate (T4)92
9
Placing a bottle in a container (T6)R-NDF simulation Upright pose
Success Rate91
9
Placing a bottle in a container (T6)R-NDF simulation (Arbitrary pose)
Success Rate93
9
Placing a bowl on a mug (T5)R-NDF simulation Upright pose
Success Rate92
9
Placing a bowl on a mug (T5)R-NDF simulation (Arbitrary pose)
Success Rate91
9
Showing 10 of 15 rows

Other info

Follow for update