Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning
About
Can we endow visuomotor robots with generalization capabilities to operate in diverse open-world scenarios? In this paper, we propose \textbf{Maniwhere}, a generalizable framework tailored for visual reinforcement learning, enabling the trained robot policies to generalize across a combination of multiple visual disturbance types. Specifically, we introduce a multi-view representation learning approach fused with Spatial Transformer Network (STN) module to capture shared semantic information and correspondences among different viewpoints. In addition, we employ a curriculum-based randomization and augmentation approach to stabilize the RL training process and strengthen the visual generalization ability. To exhibit the effectiveness of Maniwhere, we meticulously design 8 tasks encompassing articulate objects, bi-manual, and dexterous hand manipulation tasks, demonstrating Maniwhere's strong visual generalization and sim2real transfer abilities across 3 hardware platforms. Our experiments show that Maniwhere significantly outperforms existing state-of-the-art methods. Videos are provided at https://gemcollector.github.io/maniwhere/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cube Lift | AIRBOT Play CubeLift | Success Rate86.4 | 11 | |
| Button press | Maniwhere-inspired Benchmark AIRBOT Play | Success Rate96.4 | 8 | |
| Button Press Dex | Maniwhere-inspired Benchmark UR5 | Success Rate96 | 8 | |
| Drawer-Open | Maniwhere-inspired Benchmark UR5 | Success Rate76.4 | 8 | |
| Hand Over Dual | Maniwhere-inspired Benchmark Franka | Success Rate93.2 | 8 | |
| Laptop Close | Maniwhere-inspired Benchmark AIRBOT Play | Success Rate86.4 | 8 | |
| Pick & Place Dex | Maniwhere-inspired Benchmark Franka | Success Rate76 | 8 | |
| Pick-&-Place | Maniwhere-inspired Benchmark AIRBOT Play | Success Rate88.4 | 8 | |
| Reach | Maniwhere-inspired Benchmark AIRBOT Play | Success Rate97.2 | 8 | |
| Reach Dex | Maniwhere-inspired Benchmark UR5 | Success Rate96.8 | 8 |