Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DexPBT: Scaling up Dexterous Manipulation for Hand-Arm Systems with Population Based Training

About

In this work, we propose algorithms and methods that enable learning dexterous object manipulation using simulated one- or two-armed robots equipped with multi-fingered hand end-effectors. Using a parallel GPU-accelerated physics simulator (Isaac Gym), we implement challenging tasks for these robots, including regrasping, grasp-and-throw, and object reorientation. To solve these problems we introduce a decentralized Population-Based Training (PBT) algorithm that allows us to massively amplify the exploration capabilities of deep reinforcement learning. We find that this method significantly outperforms regular end-to-end learning and is able to discover robust control policies in challenging tasks. Video demonstrations of learned behaviors and the code can be found at https://sites.google.com/view/dexpbt

Aleksei Petrenko, Arthur Allshire, Gavriel State, Ankur Handa, Viktor Makoviychuk• 2023

Related benchmarks

TaskDatasetResultRank
Dexterous ManipulationAllegroHand
Average Performance1.32e+4
4
Dexterous ManipulationTwo-Arms Reorientation
Average Performance26.43
4
Dexterous ManipulationRegrasping
Average Performance35.26
4
Dexterous ManipulationReorientation
Average Performance2.92
4
Dexterous ManipulationThrow
Average Performance19.08
4
Dexterous ManipulationShadowHand
Average Performance1.03e+4
4
Showing 6 of 6 rows

Other info

Follow for update