Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing

About

Reusing pre-collected data from different domains is an appealing solution for decision-making tasks, especially when data in the target domain are limited. Existing cross-domain policy transfer methods mostly aim at learning domain correspondences or corrections to facilitate policy learning, such as learning task/domain-specific discriminators, representations, or policies. This design philosophy often results in heavy model architectures or task/domain-specific modeling, lacking flexibility. This reality makes us wonder: can we directly bridge the domain gaps universally at the data level, instead of relying on complex downstream cross-domain policy transfer procedures? In this study, we propose the Cross-Domain Trajectory EDiting (xTED) framework that employs a specially designed diffusion model for cross-domain trajectory adaptation. Our proposed model architecture effectively captures the intricate dependencies among states, actions, and rewards, as well as the dynamics patterns within target data. Edited by adding noises and denoising with the pre-trained diffusion model, source domain trajectories can be transformed to align with target domain properties while preserving original semantic information. This process effectively corrects underlying domain gaps, enhancing state realism and dynamics reliability in source data, and allowing flexible integration with various single-domain and cross-domain downstream policy learning methods. Despite its simplicity, xTED demonstrates superior performance in extensive simulation and real-robot experiments.

Haoyi Niu, Qimao Chen, Tenglong Liu, Jianxiong Li, Guyue Zhou, Yi Zhang, Jianming Hu, Xianyuan Zhan• 2024

Related benchmarks

TaskDatasetResultRank
LocomotionD4RL HalfCheetah Medium-Replay
Normalized Score0.33
61
Locomotionhalfcheetah medium v2
Average Normalized Score41.2
12
Locomotionhalfcheetah medium-expert v2
Average Normalized Score43.8
12
Locomotionwalker2d medium-replay v2
Average Normalized Score25.9
12
LocomotionWalker2d Medium-Expert v2
Average Normalized Score82.9
12
Locomotionwalker2d medium v2
Average Normalized Score58.9
12
Robot Task CompletionCup
Success Rate97
4
Robot Task CompletionCup w/ distractors
Success Rate73
4
Robot Task CompletionDuck
Success Rate63
4
Robot Task CompletionDuck w/ distractors
Success Rate83
4
Showing 10 of 12 rows

Other info

Follow for update