Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning

About

Learning from multiple domains is a primary factor that influences the generalization of a single unified robot system. In this paper, we aim to learn the trajectory prediction model by using broad out-of-domain data to improve its performance and generalization ability. Trajectory model is designed to predict any-point trajectories in the current frame given an instruction and can provide detailed control guidance for robotic policy learning. To handle the diverse out-of-domain data distribution, we propose a sparsely-gated MoE (\textbf{Top-1} gating strategy) architecture for trajectory model, coined as \textbf{Tra-MoE}. The sparse activation design enables good balance between parameter cooperation and specialization, effectively benefiting from large-scale out-of-domain data while maintaining constant FLOPs per token. In addition, we further introduce an adaptive policy conditioning technique by learning 2D mask representations for predicted trajectories, which is explicitly aligned with image observations to guide action prediction more flexibly. We perform extensive experiments on both simulation and real-world scenarios to verify the effectiveness of Tra-MoE and adaptive policy conditioning technique. We also conduct a comprehensive empirical study to train Tra-MoE, demonstrating that our Tra-MoE consistently exhibits superior performance compared to the dense baseline model, even when the latter is scaled to match Tra-MoE's parameter count.

Jiange Yang, Haoyi Zhu, Yating Wang, Gangshan Wu, Tong He, Limin Wang• 2024

Related benchmarks

Task	Dataset	Result
Robot Manipulation	LIBERO	Object Achievement77	1025
Robotic Manipulation	LIBERO-Goal (test)	Success Rate78	19
Text-conditioned trajectory prediction	LIBERO-90	Side MSE39.77	8
Text-conditioned trajectory prediction	LIBERO-10	Side MSE50.37	8
Trajectory Prediction	LIBERO-10	MSE (Side)40.54	8
Robotic Manipulation	LIBERO Spatial (test)	Success Rate (SR)73	7
Trajectory Prediction	LIBERO Goal	Side MSE27.56	4
Trajectory Prediction	LIBERO Object	MSE (Side)14.07	4
Trajectory Prediction	LIBERO Spatial	Side MSE37.62	4

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord