Local Manifold Approximation and Projection for Manifold-Aware Diffusion Planning
About
Recent advances in diffusion-based generative modeling have demonstrated significant promise in tackling long-horizon, sparse-reward tasks by leveraging offline datasets. While these approaches have achieved promising results, their reliability remains inconsistent due to the inherent stochastic risk of producing infeasible trajectories, limiting their applicability in safety-critical applications. We identify that the primary cause of these failures is inaccurate guidance during the sampling procedure, and demonstrate the existence of manifold deviation by deriving a lower bound on the guidance gap. To address this challenge, we propose Local Manifold Approximation and Projection (LoMAP), a training-free method that projects the guided sample onto a low-rank subspace approximated from offline datasets, preventing infeasible trajectory generation. We validate our approach on standard offline reinforcement learning benchmarks that involve challenging long-horizon planning. Furthermore, we show that, as a standalone module, LoMAP can be incorporated into the hierarchical diffusion planner, providing further performance enhancements.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Locomotion | D4RL walker2d-medium-expert | Normalized Score109.2 | 63 | |
| Locomotion | D4RL HalfCheetah Medium-Replay | Normalized Score0.391 | 61 | |
| Locomotion | D4RL Walker2d medium | Normalized Score79.9 | 60 | |
| Locomotion | D4RL Halfcheetah medium | Normalized Score45.4 | 60 | |
| Locomotion | D4RL halfcheetah-medium-expert | Normalized Score91.1 | 53 | |
| Offline Reinforcement Learning | D4RL antmaze-large (diverse) | Normalized Score39.3 | 37 | |
| Offline Reinforcement Learning | D4RL antmaze-large (play) | Normalized Score0.207 | 36 | |
| Locomotion | D4RL Hopper medium | Normalized Score93.7 | 30 | |
| Offline Reinforcement Learning | D4RL antmaze-medium-play | Normalized Score40.7 | 23 | |
| Locomotion | D4RL hopper-medium-expert | -- | 18 |