Learning to Walk in the Real World with Minimal Human Effort
About
Reliable and stable locomotion has been one of the most fundamental challenges for legged robots. Deep reinforcement learning (deep RL) has emerged as a promising method for developing such control policies autonomously. In this paper, we develop a system for learning legged locomotion policies with deep RL in the real world with minimal human effort. The key difficulties for on-robot learning systems are automatic data collection and safety. We overcome these two challenges by developing a multi-task learning procedure and a safety-constrained RL framework. We tested our system on the task of learning to walk on three different terrains: flat ground, a soft mattress, and a doormat with crevices. Our system can automatically and efficiently learn locomotion skills on a Minitaur robot with little human intervention. The supplemental video can be found at: \url{https://youtu.be/cwyiq6dCgOc}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robot navigation | Robot Navigation Task | Success Rate64 | 7 | |
| Constrained Multi-Objective Reinforcement Learning | Building thermal control environment | Cost Sum171.4 | 6 | |
| Constrained Multi-Objective Reinforcement Learning | MoAnt v5 | Cost Sum36.8 | 6 | |
| Multi-objective Locomotion Control | MoAnt v5 | Total Cost47.8 | 6 |