Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DiffuserLite: Towards Real-time Diffusion Planning

About

Diffusion planning has been recognized as an effective decision-making paradigm in various domains. The capability of generating high-quality long-horizon trajectories makes it a promising research direction. However, existing diffusion planning methods suffer from low decision-making frequencies due to the expensive iterative sampling cost. To alleviate this, we introduce DiffuserLite, a super fast and lightweight diffusion planning framework, which employs a planning refinement process (PRP) to generate coarse-to-fine-grained trajectories, significantly reducing the modeling of redundant information and leading to notable increases in decision-making frequency. Our experimental results demonstrate that DiffuserLite achieves a decision-making frequency of 122.2Hz (112.7x faster than predominant frameworks) and reaches state-of-the-art performance on D4RL, Robomimic, and FinRL benchmarks. In addition, DiffuserLite can also serve as a flexible plugin to increase the decision-making frequency of other diffusion planning algorithms, providing a structural design reference for future works. More details and visualizations are available at https://diffuserlite.github.io/.

Zibin Dong, Jianye Hao, Yifu Yuan, Fei Ni, Yitian Wang, Pengyi Li, Yan Zheng• 2024

Related benchmarks

TaskDatasetResultRank
Offline Reinforcement LearningD4RL Franka Kitchen
Mixed Success Rate73.6
22
Robotic Manipulation (Square)Square PH
Success Rate81.8
16
Robotic Manipulation (Lift)Lift PH
Success Rate100
11
Offline Reinforcement LearningD4RL Medium
HalfCheetah Score48.9
9
Offline Reinforcement LearningD4RL Medium-Expert
HalfCheetah Score90.8
9
Offline Reinforcement LearningD4RL Medium-Replay
HalfCheetah42.9
9
Offline Reinforcement LearningD4RL Antmaze (Play)
AntMaze Medium Score88.8
8
Offline Reinforcement LearningD4RL Antmaze-Diverse
AntMaze-Medium9.24e+3
8
Offline Reinforcement LearningD4RL MuJoCo locomotion v2
Runtime (s)0.005
6
Offline Reinforcement LearningD4RL Kitchen manipulation v0
Runtime (s)0.01
6
Showing 10 of 14 rows

Other info

Follow for update