Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bi-HIL: Bilateral Control-Based Multimodal Hierarchical Imitation Learning via Subtask-Level Progress Rate and Keyframe Memory for Long-Horizon Contact-Rich Robotic Manipulation

About

Long-horizon contact-rich robotic manipulation remains challenging due to partial observability and unstable subtask transitions under contact uncertainty. While hierarchical architectures improve temporal reasoning and bilateral imitation learning enables force-aware control, existing approaches often rely on flat policies that struggle with long-horizon coordination. We propose Bi-HIL, a bilateral control-based multimodal hierarchical imitation learning framework for long-horizon manipulation. Bi-HIL stabilizes hierarchical coordination by integrating keyframe memory with subtask-level progress rate that models phase progression within the active subtask and conditions both high- and low-level policies. We evaluate Bi-HIL on unimanual and bimanual real-robot tasks, demonstrating consistent improvements over flat and ablated variants. The results highlight the importance of explicitly modeling subtask progression together with force-aware control for robust long-horizon manipulation. For additional material, please check: https://mertcookimg.github.io/bi-hil

Thanpimon Buamanee, Masato Kobayashi, Yuki Uranishi• 2026

Related benchmarks

TaskDatasetResultRank
Put-Three-Balls-in-DrawerPut-Three-Balls-in-Drawer (Left)
Success Rate #1100
5
Put-Three-Balls-in-DrawerPut-Three-Balls-in-Drawer Right
Step 1 Success Rate100
5
Bimanual Manipulation6-Cup Downstack
Sub-metric #1 Success Rate100
3
Bimanual Robotic Manipulation4-Peg-in-Hole
Success Rate Subtask 1100
3
Showing 4 of 4 rows

Other info

Follow for update