BiKC+: Bimanual Hierarchical Imitation with Keypose-Conditioned Coordination-Aware Consistency Policies
About
Robots are essential in industrial manufacturing due to their reliability and efficiency. They excel in performing simple and repetitive unimanual tasks but still face challenges with bimanual manipulation. This difficulty arises from the complexities of coordinating dual arms and handling multi-stage processes. Recent integration of generative models into imitation learning (IL) has made progress in tackling specific challenges. However, few approaches explicitly consider the multi-stage nature of bimanual tasks while also emphasizing the importance of inference speed. In multi-stage tasks, failures or delays at any stage can cascade over time, impacting the success and efficiency of subsequent sub-stages and ultimately hindering overall task performance. In this paper, we propose a novel keypose-conditioned coordination-aware consistency policy tailored for bimanual manipulation. Our framework instantiates hierarchical imitation learning with a high-level keypose predictor and a low-level trajectory generator. The predicted keyposes serve as sub-goals for trajectory generation, indicating targets for individual sub-stages. The trajectory generator is formulated as a consistency model, generating action sequences based on historical observations and predicted keyposes in a single inference step. In particular, we devise an innovative approach for identifying bimanual keyposes, considering both robot-centric action features and task-centric operation styles. Simulation and real-world experiments illustrate that our approach significantly outperforms baseline methods in terms of success rates and operational efficiency. Implementation codes can be found at https://github.com/JoanaHXU/BiKC-plus.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Bimanual Insertion | ALOHA Simulation Insertion (Evaluation) | Overall Success Rate98 | 12 | |
| Bimanual Transfer | ALOHA Simulation Transfer (Evaluation) | Overall Success Rate98 | 12 | |
| Pants Hanging | Pants Hanging Real-world | Grasp Hanger Success Rate100 | 4 | |
| Placing and Picking on Conveyor | Placing and Picking on Conveyor Real-world | Put Success Rate100 | 4 | |
| Screwdriver Packing | Screwdriver Packing Real-world | Pick Success Rate85 | 4 |