HiPolicy: Hierarchical Multi-Frequency Action Chunking for Policy Learning
About
Robotic imitation learning faces a fundamental trade-off between modeling long-horizon dependencies and enabling fine-grained closed-loop control. Existing fixed-frequency action chunking approaches struggle to achieve both. Building on this insight, we propose HiPolicy, a hierarchical multi-frequency action chunking framework that jointly predicts action sequences at different frequencies to capture both coarse high-level plans and precise reactive motions. We extract and fuse hierarchical features from history observations aligned to each frequency for multi-frequency chunk generation, and introduce an entropy-guided execution mechanism that adaptively balances long-horizon planning with fine-grained control based on action uncertainty. Experiments on diverse simulated benchmarks and real-world manipulation tasks show that HiPolicy can be seamlessly integrated into existing 2D and 3D generative policies, delivering consistent improvements in performance while significantly enhancing execution efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Robotic Manipulation | RoboTwin 2.0 | Average Success Rate88 | 64 | |
| Robotic Manipulation | RoboTwin 1.0 | Success Rate100 | 48 | |
| Close microwave door | Real-world Robot Tasks 1.0 (test) | Success Rate100 | 2 | |
| Open microwave door | Real-world Robot Tasks 1.0 (test) | Success Rate90 | 2 | |
| Pack the package | Real-world Robot Tasks 1.0 (test) | Success Rate90 | 2 | |
| Place Bowl | Real-world Robot Tasks 1.0 (test) | Success Rate90 | 2 | |
| Press toaster button | Real-world Robot Tasks 1.0 (test) | Success Rate80 | 2 | |
| Stack Cube | Real-world Robot Tasks 1.0 (test) | Success Rate80 | 2 | |
| Store vegetables | Real-world Robot Tasks 1.0 (test) | Success Rate90 | 2 | |
| Sweep board | Real-world Robot Tasks 1.0 (test) | Success Rate60 | 2 |