Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

About

Predicting and executing a sequence of actions without intermediate replanning, known as action chunking, is increasingly used in robot learning from human demonstrations. Yet, its effects on the learned policy remain inconsistent: some studies find it crucial for achieving strong results, while others observe decreased performance. In this paper, we first dissect how action chunking impacts the divergence between a learner and a demonstrator. We find that action chunking allows the learner to better capture the temporal dependencies in demonstrations but at the cost of reduced reactivity to unexpected states. To address this tradeoff, we propose Bidirectional Decoding (BID), a test-time inference algorithm that bridges action chunking with closed-loop adaptation. At each timestep, BID samples multiple candidate predictions and searches for the optimal one based on two criteria: (i) backward coherence, which favors samples that align with previous decisions; (ii) forward contrast, which seeks samples of high likelihood for future plans. By coupling decisions within and across action chunks, BID promotes both long-term consistency and short-term reactivity. Experimental results show that our method boosts the performance of two state-of-the-art generative policies across seven simulation benchmarks and two real-world tasks. Code and videos are available at https://bid-robot.github.io.

Yuejiang Liu, Jubayer Ibn Hamid, Annie Xie, Yoonho Lee, Maximilian Du, Chelsea Finn• 2024

Related benchmarks

Task	Dataset	Result
Robotic Task Completion	Kinetix	Success Rate89.7	60
Robotic Manipulation	Push T	Success Rate73.3	40
Robotic Manipulation	Image PushT	Success Rate65	17
Robotic Manipulation	MOVEBENCH Accelerated Motion - Accelerating Hard	Success Rate33	12
Robotic Manipulation	MOVEBENCH Irregular Motion - Rand. Walk	Success Rate68	12
Robotic Manipulation	MOVEBENCH Irregular Motion - Stop & Go	Success Rate51	12
Robotic Manipulation	MOVEBENCH Average Dynamic Only	Success Rate54.6	12
Robotic Manipulation	MOVEBENCH Average All	Success Rate57	12
Robotic Manipulation	MOVEBENCH Uniform Motion - Moving Easy	Success Rate80	12
Robotic Manipulation	MOVEBENCH Uniform Motion - Moving Med.	Success Rate75	12

Showing 10 of 21 rows

Other info

Follow for update

@wizwand_team Discord