DiffuserLite: Towards Real-time Diffusion Planning
About
Diffusion planning has been recognized as an effective decision-making paradigm in various domains. The capability of generating high-quality long-horizon trajectories makes it a promising research direction. However, existing diffusion planning methods suffer from low decision-making frequencies due to the expensive iterative sampling cost. To alleviate this, we introduce DiffuserLite, a super fast and lightweight diffusion planning framework, which employs a planning refinement process (PRP) to generate coarse-to-fine-grained trajectories, significantly reducing the modeling of redundant information and leading to notable increases in decision-making frequency. Our experimental results demonstrate that DiffuserLite achieves a decision-making frequency of 122.2Hz (112.7x faster than predominant frameworks) and reaches state-of-the-art performance on D4RL, Robomimic, and FinRL benchmarks. In addition, DiffuserLite can also serve as a flexible plugin to increase the decision-making frequency of other diffusion planning algorithms, providing a structural design reference for future works. More details and visualizations are available at https://diffuserlite.github.io/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Offline Reinforcement Learning | D4RL Franka Kitchen | Mixed Success Rate73.6 | 22 | |
| Robotic Manipulation (Square) | Square PH | Success Rate81.8 | 16 | |
| Robotic Manipulation (Lift) | Lift PH | Success Rate100 | 11 | |
| Offline Reinforcement Learning | D4RL Medium | HalfCheetah Score48.9 | 9 | |
| Offline Reinforcement Learning | D4RL Medium-Expert | HalfCheetah Score90.8 | 9 | |
| Offline Reinforcement Learning | D4RL Medium-Replay | HalfCheetah42.9 | 9 | |
| Offline Reinforcement Learning | D4RL Antmaze (Play) | AntMaze Medium Score88.8 | 8 | |
| Offline Reinforcement Learning | D4RL Antmaze-Diverse | AntMaze-Medium9.24e+3 | 8 | |
| Offline Reinforcement Learning | D4RL MuJoCo locomotion v2 | Runtime (s)0.005 | 6 | |
| Offline Reinforcement Learning | D4RL Kitchen manipulation v0 | Runtime (s)0.01 | 6 |