BiNoMaP: Learning Category-Level Bimanual Non-Prehensile Manipulation Primitives
About
Non-prehensile manipulation, encompassing ungraspable actions such as pushing, poking, pivoting, and wrapping, remains underexplored due to its contact-rich and analytically intractable nature. We revisit this problem from two perspectives. First, instead of relying on single-arm setups or favorable environmental supports (e.g., walls or edges), we advocate a generalizable dual-arm configuration and establish a suite of Bimanual Non-prehensile Manipulation Primitives (BiNoMaP). Second, departing from prevailing RL-based approaches, we propose a three-stage, RL-free framework for learning structured non-prehensile skills. We begin by extracting bimanual hand motion trajectories from video demonstrations. Since these coarse trajectories suffer from perceptual noise and morphological discrepancies, we introduce a geometry-aware post-optimization algorithm to refine them into executable manipulation primitives consistent with predefined motion patterns. To enable category-level generalization, the learned primitives are further parameterized by object-relevant geometric attributes, primarily size, allowing adaptation to unseen instances with significant shape variations. Importantly, BiNoMaP supports cross-embodiment transfer: the same primitives can be deployed on two real-world dual-arm platforms with distinct kinematic configurations, without redesigning skill structures. Extensive real-robot experiments across diverse objects and spatial configurations demonstrate the effectiveness, efficiency, and strong generalization capability of our approach.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Bimanual Non-Prehensile Manipulation | BiNoMaP Bimanual Manipulation Suite | Poking (L/R) Plastic Cup Success Rate0.9667 | 7 | |
| Non-prehensile manipulation | Real-world Manipulation (Category-Level split) | Average Success Rate76.2 | 5 | |
| pivoting (LR) | Real-world Manipulation (Category-Level split) | Success Rate (Bowls)0.6571 | 5 | |
| poking (L/R) | Real-world Manipulation Category-Level | Success Rate (Cups)0.85 | 5 | |
| wrapping (LR) | Real-world Manipulation Category-Level | Success Rate (2 balls)85 | 5 | |
| Non-prehensile manipulation | Real-world Manipulation Instance-Level | Success Rate86.7 | 1 | |
| pivoting (LR) | Real-world Manipulation Instance-Level | Success Rate (Plastic Bowl)20 | 1 | |
| poking (L/R) | Real-world Manipulation (Instance-Level split) | Plastic Cup Success Rate29 | 1 | |
| pushing (LR) | Real-world Manipulation Instance-Level | Success Rate (Heavy Basket)28 | 1 | |
| pushing (LR) | Real-world Manipulation (Category-Level split) | Success Rate (Other 5 Baskets)43 | 1 |