TwinVLA: Data-Efficient Bimanual Manipulation with Twin Single-Arm Vision-Language-Action Models
About
Vision-language-action models (VLAs) trained on large-scale robotic datasets have demonstrated strong performance on manipulation tasks, including bimanual tasks. However, because most public datasets focus on single-arm demonstrations, adapting VLAs for bimanual tasks typically requires substantial additional bimanual data and fine-tuning. To address this challenge, we introduce TwinVLA, a modular framework that composes two copies of a pretrained single-arm VLA into a coordinated bimanual VLA. Unlike monolithic cross-embodiment models trained on mixtures of single-arm and bimanual data, TwinVLA improves both data efficiency and performance by composing pretrained single-arm policies. Across diverse bimanual tasks in real-world and simulation settings, TwinVLA outperforms a comparably-sized monolithic RDT-1B model without requiring any bimanual pretraining. Furthermore, it narrows the gap to state-of-the-art model $\pi_0$, which relies on extensive proprietary bimanual data and compute cost. These results establish our modular composition approach as a data-efficient and scalable path toward high-performance bimanual manipulation, leveraging public single-arm data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robot Manipulation | LIBERO | Goal Achievement93.5 | 494 | |
| Brush to dustpan | Real-World (test) | Success Rate: Move brush100 | 4 | |
| Carrot to bag | Real-World (test) | Pick Success Rate100 | 4 | |
| Dish drainer manipulation | Tabletop-Sim Easy | Success Rate95.4 | 4 | |
| Dish drainer manipulation | Tabletop-Sim Hard split | Success Rate0.836 | 4 | |
| Fold Towel | Real-World (test) | First Fold Success Rate1 | 4 | |
| Handover box manipulation | Tabletop-Sim Hard split | Success Rate53 | 4 | |
| Shoes table manipulation | Tabletop-Sim Easy | Success Rate84.8 | 4 | |
| Extract hexkey | Real-World (test) | Pick up Success Rate90 | 4 | |
| Handover box manipulation | Tabletop-Sim Easy | Success Rate78 | 4 |