Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hierarchical Planning with Latent World Models

About

Model predictive control (MPC) with learned world models has emerged as a promising paradigm for embodied control, particularly for its ability to generalize zero-shot when deployed in new environments. However, learned world models often struggle with long-horizon control due to the accumulation of prediction errors and the exponentially growing search space. In this work, we address these challenges by learning latent world models at multiple temporal scales and performing hierarchical planning across these scales, enabling long-horizon reasoning while substantially reducing inference-time planning complexity. Our approach serves as a modular planning abstraction that applies across diverse latent world-model architectures and domains. We demonstrate that this hierarchical approach enables zero-shot control on real-world non-greedy robotic tasks, achieving a 70% success rate on pick-&-place using only a final goal specification, compared to 0% for a single-level world model. In addition, across physics-based simulated environments including push manipulation and maze navigation, hierarchical planning achieves higher success while requiring up to 4x less planning-time compute.

Wancong Zhang, Basile Terver, Artem Zholus, Soham Chitnis, Harsh Sutaria, Mido Assran, Randall Balestriero, Amir Bar, Adrien Bardes, Yann LeCun, Nicolas Ballas• 2026

Related benchmarks

TaskDatasetResultRank
DrawerFranka Robot Real-world
Average Success Rate70
11
Robotic ManipulationFranka Pick-&-Place
Cup Success Rate80
8
Maze NavigationDiverse Maze Medium, D ∈ [9, 12] MuJoCo PointMaze (test)
Success Rate95
5
Maze NavigationDiverse Maze Hard, D ∈ [13, 16] MuJoCo PointMaze (test)
Success Rate83
5
PushT Robotic ManipulationPushT (val)
Success Rate (d=25)89
5
Maze NavigationDiverse Maze MuJoCo PointMaze Easy, D ∈ [5, 8] (test)
Success Rate100
5
Showing 6 of 6 rows

Other info

Follow for update