Compositional Monte Carlo Tree Diffusion for Extendable Planning
About
Monte Carlo Tree Diffusion (MCTD) integrates diffusion models with structured tree search to enable effective trajectory exploration through stepwise reasoning. However, MCTD remains fundamentally limited by training trajectory lengths. While periodic replanning allows plan concatenation for longer plan generation, the planning process remains locally confined, as MCTD searches within individual trajectories without access to global context. We propose Compositional Monte Carlo Tree Diffusion (C-MCTD), a framework that elevates planning from individual trajectory optimization to reasoning over complete plan compositions. C-MCTD introduces three complementary components: (1) Online Composer, which performs globally-aware planning by searching across entire plan compositions; (2) Distributed Composer, which reduces search complexity through parallel exploration from multiple starting points; and (3) Preplan Composer, which accelerates inference by leveraging cached plan graphs.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Goal-conditioned locomotion | OGBench PointMaze-Stitch Giant | Success Rate100 | 14 | |
| Long-horizon navigation | OGBench AntMaze-Stitch Giant | Success Rate75 | 5 | |
| Long-horizon navigation | OGBench AntMaze-Stitch Large | Success Rate94 | 4 | |
| Long-horizon navigation | OGBench AntMaze-Stitch Medium | Success Rate98 | 4 |