Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Physics-Based Motion Imitation with Adversarial Differential Discriminators

About

Multi-objective optimization problems, which require the simultaneous optimization of multiple objectives, are prevalent across numerous applications. Existing multi-objective optimization methods often rely on manually-tuned aggregation functions to formulate a joint optimization objective. The performance of such hand-tuned methods is heavily dependent on careful weight selection, a time-consuming and laborious process. These limitations also arise in the setting of reinforcement-learning-based motion tracking methods for physically simulated characters, where intricately crafted reward functions are typically used to achieve high-fidelity results. Such solutions not only require domain expertise and significant manual tuning, but also limit the applicability of the resulting reward function across diverse skills. To bridge this gap, we present a novel adversarial multi-objective optimization technique that is broadly applicable to a range of multi-objective reinforcement-learning tasks, including motion tracking. Our proposed Adversarial Differential Discriminator (ADD) receives a single positive sample, yet is still effective at guiding the optimization process. We demonstrate that our technique can enable characters to closely replicate a variety of acrobatic and agile behaviors, achieving comparable quality to state-of-the-art motion-tracking methods, without relying on manually-designed reward functions. Code and results are available at https://add-moo.github.io/.

Ziyu Zhang, Sergey Bashkirov, Dun Yang, Yi Shi, Michael Taylor, Xue Bin Peng• 2025

Related benchmarks

TaskDatasetResultRank
Single-trajectory motion-trackingWalk skill trajectory
Successful Samples Count (SR >= 80%)7.00e+7
2
Single-trajectory motion-trackingRun skill trajectory
Samples SR >= 80%113.7
2
Single-trajectory motion-trackingJump skill trajectory
Successful Samples Count (x1M)2.23e+8
2
Single-trajectory motion-trackingSpinkick skill trajectory
Sample Count (M)135.6
2
Single-trajectory motion-trackingCartwheel skill trajectory
Samples Count (SR >= 80%) [M]183.6
2
Single-trajectory motion-trackingSideflip skill trajectory
Samples (SR >= 80%)336.5
2
Single-trajectory motion-trackingSpeed Vault skill trajectory
Samples (SR >= 80%)389
2
Showing 7 of 7 rows

Other info

Follow for update