Physics-Based Motion Imitation with Adversarial Differential Discriminators

About

Multi-objective optimization problems, which require the simultaneous optimization of multiple objectives, are prevalent across numerous applications. Existing multi-objective optimization methods often rely on manually-tuned aggregation functions to formulate a joint optimization objective. The performance of such hand-tuned methods is heavily dependent on careful weight selection, a time-consuming and laborious process. These limitations also arise in the setting of reinforcement-learning-based motion tracking methods for physically simulated characters, where intricately crafted reward functions are typically used to achieve high-fidelity results. Such solutions not only require domain expertise and significant manual tuning, but also limit the applicability of the resulting reward function across diverse skills. To bridge this gap, we present a novel adversarial multi-objective optimization technique that is broadly applicable to a range of multi-objective reinforcement-learning tasks, including motion tracking. Our proposed Adversarial Differential Discriminator (ADD) receives a single positive sample, yet is still effective at guiding the optimization process. We demonstrate that our technique can enable characters to closely replicate a variety of acrobatic and agile behaviors, achieving comparable quality to state-of-the-art motion-tracking methods, without relying on manually-designed reward functions. Code and results are available at https://add-moo.github.io/.

Ziyu Zhang, Sergey Bashkirov, Dun Yang, Yi Shi, Michael Taylor, Xue Bin Peng• 2025

Related benchmarks

Task	Dataset	Result
Single-trajectory motion-tracking	Walk skill trajectory	Successful Samples Count (SR >= 80%)7.00e+7	2
Single-trajectory motion-tracking	Run skill trajectory	Samples SR >= 80%113.7	2
Single-trajectory motion-tracking	Jump skill trajectory	Successful Samples Count (x1M)2.23e+8	2
Single-trajectory motion-tracking	Spinkick skill trajectory	Sample Count (M)135.6	2
Single-trajectory motion-tracking	Cartwheel skill trajectory	Samples Count (SR >= 80%) [M]183.6	2
Single-trajectory motion-tracking	Sideflip skill trajectory	Samples (SR >= 80%)336.5	2
Single-trajectory motion-tracking	Speed Vault skill trajectory	Samples (SR >= 80%)389	2

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord