Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection
About
This paper proposes a novel alternative to existing sim-to-real methods for training control policies with simulated experiences. Prior sim-to-real methods for legged robots mostly rely on the domain randomization approach, where a fixed finite set of simulation parameters is randomized during training. Instead, our method adds state-dependent perturbations to the input joint torque used for forward simulation during the training phase. These state-dependent perturbations are designed to simulate a broader range of reality gaps than those captured by randomizing a fixed set of simulation parameters. Experimental results show that our method enables humanoid locomotion policies that achieve greater robustness against complex reality gaps unseen in the training domain.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Humanoid Locomotion | IsaacGym S1: Standard DR Sim-to-real gap 2048 envs (test) | Success Rate99.7 | 3 | |
| Humanoid Locomotion | IsaacGym S2: Wider DR 2048 envs Sim-to-real gap (test) | Success Rate98.4 | 3 |