Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Latent Diffusion Planning for Imitation Learning

About

Recent progress in imitation learning has been enabled by policy architectures that scale to complex visuomotor tasks, multimodal distributions, and large datasets. However, these methods often rely on learning from large amount of expert demonstrations. To address these shortcomings, we propose Latent Diffusion Planning (LDP), a modular approach consisting of a planner which can leverage action-free demonstrations, and an inverse dynamics model which can leverage suboptimal data, that both operate over a learned latent space. First, we learn a compact latent space through a variational autoencoder, enabling effective forecasting of future states in image-based domains. Then, we train a planner and an inverse dynamics model with diffusion objectives. By separating planning from action prediction, LDP can benefit from the denser supervision signals of suboptimal and action-free data. On simulated visual robotic manipulation tasks, LDP outperforms state-of-the-art imitation learning approaches, as they cannot leverage such additional data.

Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn• 2025

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationRobomimic Can
Success Rate98
30
Robotic ManipulationRobomimic Lift
Success Rate100
28
Robotic ManipulationRobomimic Square
Success Rate83
26
Showing 3 of 3 rows

Other info

Follow for update