Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning

About

Can we endow visuomotor robots with generalization capabilities to operate in diverse open-world scenarios? In this paper, we propose \textbf{Maniwhere}, a generalizable framework tailored for visual reinforcement learning, enabling the trained robot policies to generalize across a combination of multiple visual disturbance types. Specifically, we introduce a multi-view representation learning approach fused with Spatial Transformer Network (STN) module to capture shared semantic information and correspondences among different viewpoints. In addition, we employ a curriculum-based randomization and augmentation approach to stabilize the RL training process and strengthen the visual generalization ability. To exhibit the effectiveness of Maniwhere, we meticulously design 8 tasks encompassing articulate objects, bi-manual, and dexterous hand manipulation tasks, demonstrating Maniwhere's strong visual generalization and sim2real transfer abilities across 3 hardware platforms. Our experiments show that Maniwhere significantly outperforms existing state-of-the-art methods. Videos are provided at https://gemcollector.github.io/maniwhere/.

Zhecheng Yuan, Tianming Wei, Shuiqi Cheng, Gu Zhang, Yuanpei Chen, Huazhe Xu• 2024

Related benchmarks

TaskDatasetResultRank
Cube LiftAIRBOT Play CubeLift
Success Rate86.4
11
Button pressManiwhere-inspired Benchmark AIRBOT Play
Success Rate96.4
8
Button Press DexManiwhere-inspired Benchmark UR5
Success Rate96
8
Drawer-OpenManiwhere-inspired Benchmark UR5
Success Rate76.4
8
Hand Over DualManiwhere-inspired Benchmark Franka
Success Rate93.2
8
Laptop CloseManiwhere-inspired Benchmark AIRBOT Play
Success Rate86.4
8
Pick & Place DexManiwhere-inspired Benchmark Franka
Success Rate76
8
Pick-&-PlaceManiwhere-inspired Benchmark AIRBOT Play
Success Rate88.4
8
ReachManiwhere-inspired Benchmark AIRBOT Play
Success Rate97.2
8
Reach DexManiwhere-inspired Benchmark UR5
Success Rate96.8
8
Showing 10 of 11 rows

Other info

Follow for update