Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks

About

Realistic manipulation tasks require a robot to interact with an environment with a prolonged sequence of motor actions. While deep reinforcement learning methods have recently emerged as a promising paradigm for automating manipulation behaviors, they usually fall short in long-horizon tasks due to the exploration burden. This work introduces Manipulation Primitive-augmented reinforcement Learning (MAPLE), a learning framework that augments standard reinforcement learning algorithms with a pre-defined library of behavior primitives. These behavior primitives are robust functional modules specialized in achieving manipulation goals, such as grasping and pushing. To use these heterogeneous primitives, we develop a hierarchical policy that involves the primitives and instantiates their executions with input parameters. We demonstrate that MAPLE outperforms baseline approaches by a significant margin on a suite of simulated manipulation tasks. We also quantify the compositional structure of the learned behaviors and highlight our method's ability to transfer policies to new task variants and to physical hardware. Videos and code are available at https://ut-austin-rpl.github.io/maple

Soroush Nasiriany, Huihan Liu, Yuke Zhu• 2021

Related benchmarks

TaskDatasetResultRank
Block-stackingRoboSuite online finetuning
Mean Success Rate72.3
7
Door OpeningRoboSuite online finetuning
Mean Success Rate55.5
7
Nut AssemblyRoboSuite online finetuning
Mean Success Rate42.1
7
Pick-&-PlaceRoboSuite online finetuning
Mean Success Rate65.6
7
High-Shelf PlacementElephant Robotics 280 Real-world (test)
Success Rate66.5
7
StackingElephant Robotics 280 Real-world (test)
Success Rate68.7
7
Target GraspingElephant Robotics 280 Real-world (test)
Success Rate66
7
Robotic ManipulationObstacle 2D
Success Rate100
6
Robotic ManipulationObstacle Tower
Success Rate0.00e+0
6
Robotic ManipulationCluttered Drawer
Success Rate0.00e+0
6
Showing 10 of 11 rows

Other info

Follow for update