Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Predictive but Not Plannable: RC-aux for Latent World Models

About

A latent world model may achieve accurate short-horizon prediction while still inducing a latent space that is poorly aligned with planning. A key issue is spatiotemporal mismatch: these models are often trained with local predictive supervision, but deployed for long-horizon goal-directed search in latent spaces where Euclidean distance may not reflect what is reachable within a finite action budget. We present the Reachability-Correction auxiliary objective (RC-aux), a lightweight correction for this mismatch in reconstruction-free latent world models. RC-aux keeps the world-model backbone unchanged and adds planning-aligned supervision along two axes. Along the time axis, multi-horizon open-loop prediction trains the model beyond one-step consistency. Along the space axis, budget-conditioned reachability supervision, together with temporal hard negatives, encourages the latent space to distinguish states that are eventually reachable from those reachable within the current planning horizon. At test time, the learned reachability signal can also be used by a reachability-aware planner to favor trajectories that are both goal-directed and attainable under the available budget. We instantiate RC-aux on LeWorldModel and evaluate it under both continuation-training and matched-from-scratch settings. Across goal-conditioned pixel-control tasks and a LIBERO-Goal extension, RC-aux improves LeWM-style planning with modest additional cost. These results suggest that planning with latent world models depends not only on predictive accuracy, but also on whether the learned representation encodes the temporal and geometric structure required by downstream search. The code is available at https://github.com/Guang000/RC-aux.

Wenyuan Li, Guang Li, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama• 2026

Related benchmarks

TaskDatasetResultRank
Visuomotor ControlPush T
Success Rate90.8
21
Pixel-based goal-conditioned controlCube
Success Rate76
8
Pixel-based goal-conditioned controlTwoRoom
Success Rate98
8
Pixel-based goal-conditioned controlreacher
Success Rate87.2
5
Pixel-based goal-conditioned controlWall
Success Rate83.6
3
Showing 5 of 5 rows

Other info

Follow for update