Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

About

Predicting and executing a sequence of actions without intermediate replanning, known as action chunking, is increasingly used in robot learning from human demonstrations. Yet, its effects on the learned policy remain inconsistent: some studies find it crucial for achieving strong results, while others observe decreased performance. In this paper, we first dissect how action chunking impacts the divergence between a learner and a demonstrator. We find that action chunking allows the learner to better capture the temporal dependencies in demonstrations but at the cost of reduced reactivity to unexpected states. To address this tradeoff, we propose Bidirectional Decoding (BID), a test-time inference algorithm that bridges action chunking with closed-loop adaptation. At each timestep, BID samples multiple candidate predictions and searches for the optimal one based on two criteria: (i) backward coherence, which favors samples that align with previous decisions; (ii) forward contrast, which seeks samples of high likelihood for future plans. By coupling decisions within and across action chunks, BID promotes both long-term consistency and short-term reactivity. Experimental results show that our method boosts the performance of two state-of-the-art generative policies across seven simulation benchmarks and two real-world tasks. Code and videos are available at https://bid-robot.github.io.

Yuejiang Liu, Jubayer Ibn Hamid, Annie Xie, Yoonho Lee, Maximilian Du, Chelsea Finn• 2024

Related benchmarks

TaskDatasetResultRank
Robotic Task CompletionKinetix
Success Rate89.7
60
Robotic ManipulationMOVEBENCH Accelerated Motion - Accelerating Hard
Success Rate33
12
Robotic ManipulationMOVEBENCH Irregular Motion - Rand. Walk
Success Rate68
12
Robotic ManipulationMOVEBENCH Irregular Motion - Stop & Go
Success Rate51
12
Robotic ManipulationMOVEBENCH Average Dynamic Only
Success Rate54.6
12
Robotic ManipulationMOVEBENCH Average All
Success Rate57
12
Robotic ManipulationMOVEBENCH Uniform Motion - Moving Easy
Success Rate80
12
Robotic ManipulationMOVEBENCH Uniform Motion - Moving Med.
Success Rate75
12
Robotic ManipulationMOVEBENCH Accelerated Motion - Accelerating Med.
Success Rate50
12
Robotic ManipulationMOVEBENCH Uniform Motion - Moving Hard
Success Rate29
12
Showing 10 of 14 rows

Other info

Follow for update