Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
About
Fine manipulation tasks, such as threading cable ties or slotting a battery, are notoriously difficult for robots because they require precision, careful coordination of contact forces, and closed-loop visual feedback. Performing these tasks typically requires high-end robots, accurate sensors, or careful calibration, which can be expensive and difficult to set up. Can learning enable low-cost and imprecise hardware to perform these fine manipulation tasks? We present a low-cost system that performs end-to-end imitation learning directly from real demonstrations, collected with a custom teleoperation interface. Imitation learning, however, presents its own challenges, particularly in high-precision domains: errors in the policy can compound over time, and human demonstrations can be non-stationary. To address these challenges, we develop a simple yet novel algorithm, Action Chunking with Transformers (ACT), which learns a generative model over action sequences. ACT allows the robot to learn 6 difficult tasks in the real world, such as opening a translucent condiment cup and slotting a battery with 80-90% success, with only 10 minutes worth of demonstrations. Project website: https://tonyzhaozh.github.io/aloha/
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robotic Manipulation | LIBERO | Spatial Success Rate8 | 314 | |
| Robotic Manipulation | RoboTwin 2.0 | Average Success Rate50.8 | 64 | |
| Robot Manipulation | Adroit | Pen Task Score47 | 50 | |
| Robotic Manipulation | RLBench (test) | Average Success Rate39.6 | 49 | |
| Coffee Making/Handling | Robomimic MimicGen Coffee (D2) | Success Rate20 | 25 | |
| Coffee Preparation | Robomimic/MimicGen Coffee Prep. (D1) | Success Rate36 | 20 | |
| Bimanual Manipulation | RLBench 2 | Push Box Success Rate48.7 | 20 | |
| Mug Cleanup | Robomimic MimicGen Mug Cleanup (D1) | Success Rate26 | 20 | |
| Episodic memory-intensive manipulation | Camo-Dataset | MSR83.3 | 18 | |
| Robotic Manipulation | Push T | Success Rate81 | 16 |