HiPolicy: Hierarchical Multi-Frequency Action Chunking for Policy Learning

About

Robotic imitation learning faces a fundamental trade-off between modeling long-horizon dependencies and enabling fine-grained closed-loop control. Existing fixed-frequency action chunking approaches struggle to achieve both. Building on this insight, we propose HiPolicy, a hierarchical multi-frequency action chunking framework that jointly predicts action sequences at different frequencies to capture both coarse high-level plans and precise reactive motions. We extract and fuse hierarchical features from history observations aligned to each frequency for multi-frequency chunk generation, and introduce an entropy-guided execution mechanism that adaptively balances long-horizon planning with fine-grained control based on action uncertainty. Experiments on diverse simulated benchmarks and real-world manipulation tasks show that HiPolicy can be seamlessly integrated into existing 2D and 3D generative policies, delivering consistent improvements in performance while significantly enhancing execution efficiency.

Jiyao Zhang, Zimu Han, Junhan Wang, Xionghao Wu, Shihong Lin, Jinzhou Li, Hongwei Fan, Ruihai Wu, Dongjiang Li, Hao Dong• 2026

Related benchmarks

Task	Dataset	Result
Robotic Manipulation	RoboTwin 2.0	Average Success Rate88	115
Robotic Manipulation	RoboTwin 1.0	Success Rate100	48
Close microwave door	Real-world Robot Tasks 1.0 (test)	Success Rate100	2
Open microwave door	Real-world Robot Tasks 1.0 (test)	Success Rate90	2
Pack the package	Real-world Robot Tasks 1.0 (test)	Success Rate90	2
Place Bowl	Real-world Robot Tasks 1.0 (test)	Success Rate90	2
Press toaster button	Real-world Robot Tasks 1.0 (test)	Success Rate80	2
Stack Cube	Real-world Robot Tasks 1.0 (test)	Success Rate80	2
Store vegetables	Real-world Robot Tasks 1.0 (test)	Success Rate90	2
Sweep board	Real-world Robot Tasks 1.0 (test)	Success Rate60	2

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord