Hierarchical Planning with Latent World Models

About

World models are a promising path to zero-shot embodied control through planning. However, existing world model planners struggle on long-horizon, multi-stage tasks: prediction errors compound and naive search is exponential in the planning horizon. Hierarchy mitigates both by decomposing tasks into shorter, tractable subproblems; yet prior hierarchical approaches either amortize control into task-specific policies (hierarchical RL) or assume low-dimensional states and known dynamics (classical hierarchical MPC). We present Hierarchical Planning with Latent World Models (HWM), an architecture and planning paradigm for hierarchical model predictive control (MPC) directly on visual world models trained solely via next-latent prediction. HWM learns world models at multiple temporal scales within a shared latent space, so predictions from the long-horizon model serve as subgoals for the short-horizon model via latent matching, without task-specific rewards, skill learning, or hierarchical policies. To keep long-horizon search tractable, HWM learns an action encoder that compresses primitive action chunks into latent macro-actions. On real-world Franka manipulation, HWM solves pick-and-place from a single goal image at 70% success vs. 0% for single-level planning. Across simulated push manipulation and maze navigation, HWM consistently improves performance on long-horizon tasks while requiring up to 3x less planning compute.

Wancong Zhang, Basile Terver, Artem Zholus, Soham Chitnis, Harsh Sutaria, Mido Assran, Randall Balestriero, Amir Bar, Adrien Bardes, Yann LeCun, Nicolas Ballas• 2026

Related benchmarks

Task	Dataset	Result
Drawer	Franka Robot Real-world	Average Success Rate70	11
Robotic Manipulation	Franka Pick-&-Place	Cup Success Rate80	8
Maze Navigation	Diverse Maze Medium, D ∈ [9, 12] MuJoCo PointMaze (test)	Success Rate95	5
Maze Navigation	Diverse Maze Hard, D ∈ [13, 16] MuJoCo PointMaze (test)	Success Rate83	5
PushT Robotic Manipulation	PushT (val)	Success Rate (d=25)89	5
Maze Navigation	Diverse Maze MuJoCo PointMaze Easy, D ∈ [5, 8] (test)	Success Rate100	5
Push-T Robot Manipulation	Push-T Short-horizon t=25 13, 5	Success Rate89	5
Push-T Robot Manipulation	Push-T Long-horizon t=75 11	Success Rate61	5

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord