Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Trajectory World Models for Heterogeneous Environments

About

Heterogeneity in sensors and actuators across environments poses a significant challenge to building large-scale pre-trained world models on top of this low-dimensional sensor information. In this work, we explore pre-training world models for heterogeneous environments by addressing key transfer barriers in both data diversity and model flexibility. We introduce UniTraj, a unified dataset comprising over one million trajectories from 80 environments, designed to scale data while preserving critical diversity. Additionally, we propose TrajWorld, a novel architecture capable of flexibly handling varying sensor and actuator information and capturing environment dynamics in-context. Pre-training TrajWorld on UniTraj yields substantial gains in transition prediction, achieves a new state-of-the-art for off-policy evaluation, and also delivers superior online performance of model predictive control. To the best of our knowledge, this work, for the first time, demonstrates the transfer benefits of world models across heterogeneous and complex control environments. Code and data are available at https://github.com/thuml/TrajWorld.

Shaofeng Yin, Jialong Wu, Siqiao Huang, Xingjian Su, Xu He, Jianye Hao, Mingsheng Long• 2025

Related benchmarks

TaskDatasetResultRank
Downstream model-based controlWalker2d OpenAI Gym (test)
Accumulated Reward1.93e+3
8
Downstream model-based controlGo1 Unitree (test)
Accumulated Reward0.49
8
Downstream model-based controlHopper OpenAI Gym (test)
Accumulated Reward534.3
8
Dynamics PredictionHopper
MAE17.388
4
Trajectory PredictionCassie bipedal jumping (test)
MAE7.834
4
Trajectory PredictionUnitree A1 quadruped locomotion (test)
MAE5.138
4
Trajectory PredictionUR5 tabletop manipulation (test)
MAE8.066
4
Dynamics PredictionWalker2D
MAE22.261
4
Dynamics PredictionFranka
MAE13.102
4
Showing 9 of 9 rows

Other info

Follow for update