Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Motion Before Action: Diffusing Object Motion as Manipulation Condition

About

Inferring object motion representations from observations enhances the performance of robotic manipulation tasks. This paper introduces a new paradigm for robot imitation learning that generates action sequences by reasoning about object motion from visual observations. We propose MBA (Motion Before Action), a novel module that employs two cascaded diffusion processes for object motion generation and robot action generation under object motion guidance. MBA first predicts the future pose sequence of the object based on observations, then uses this sequence as a condition to guide robot action generation. Designed as a plug-and-play component, MBA can be flexibly integrated into existing robotic manipulation policies with diffusion action heads. Extensive experiments in both simulated and real-world environments demonstrate that our approach substantially improves the performance of existing policies across a wide range of manipulation tasks. Project page: https://selen-suyue.github.io/MBApage/

Yue Su, Xinyu Zhan, Hongjie Fang, Yong-Lu Li, Cewu Lu, Lixin Yang• 2024

Related benchmarks

TaskDatasetResultRank
Robot ManipulationMetaWorld, Adroit, and Dexart Combined
Average Success Rate58.9
25
Robotic Arm ManipulationMetaWorld Very Hard
Success Rate62.7
21
Robot ManipulationDexArt
Success Rate57
20
Robotic Manipulation SimulationMetaWorld hard
Success Rate52.8
6
Robotic Manipulation SimulationAdroit
Success Rate63.2
6
Showing 5 of 5 rows

Other info

Follow for update