PerFACT: Motion Policy with LLM-Powered Dataset Synthesis and Fusion Action-Chunking Transformers
About
Deep learning methods have significantly enhanced motion planning for robotic manipulators by leveraging prior experiences within planning datasets. However, state-of-the-art neural motion planners are primarily trained on small datasets collected in manually generated workspaces, limiting their generalizability to out-of-distribution scenarios. Additionally, these planners often rely on monolithic network architectures that struggle to encode critical planning information. To address these challenges, we introduce Motion Policy with Dataset Synthesis powered by large language models (LLMs) and Fusion Action-Chunking Transformers (PerFACT), which incorporates two key components. Firstly, a novel LLM-powered workspace generation method, MotionGeneralizer, enables large-scale planning data collection by producing a diverse set of semantically feasible workspaces. Secondly, we introduce Fusion Motion Policy Networks (MpiNetsFusion), a generalist neural motion planner that uses a fusion action-chunking transformer to better encode planning signals and attend to multiple feature modalities. Leveraging MotionGeneralizer, we collect 3.5M trajectories to train and evaluate MpiNetsFusion against state-of-the-art planners, which shows that the proposed MpiNetsFusion can plan several times faster on the evaluated tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Motion Planning | Held-out Planning Tasks Box (test) | T/T_Fusion1 | 5 | |
| Motion Planning | Held-out Planning Tasks Shelf II (test) | T/T_Fusion1 | 5 | |
| Robotic Motion Planning | Shelf Task III (held-out) | Success Rate (%)33.4 | 5 | |
| Motion Planning | Held-out Planning Tasks Average (test) | Success Rate51.2 | 5 | |
| Robotic Motion Planning | TableTop (held-out) | Success Rate (%)58 | 5 | |
| Robotic Motion Planning | Bins (held-out) | Success Rate (%)84.5 | 5 | |
| Robotic Motion Planning | Shelf Task II (held-out) | Success Rate34.6 | 5 | |
| Robotic Motion Planning | Box (held-out) | Success Rate61.3 | 5 | |
| Motion Planning | Scene OOD environment generated by MotionGeneralizer (test) | Success Rate28 | 5 | |
| Robotic Motion Planning | Shelf Task I (held-out) | Success Rate (%)35.7 | 5 |