Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Drift is a Sampling Error: SNR-Aware Power Distributions for Long-Horizon Robotic Planning

About

Despite rapid progress in Vision-Language-Action (VLA) models for robotic control, instruction drift remains a persistent failure mode in long-horizon tasks. This paper reconceptualizes this phenomenon, positing that instruction drift is fundamentally a systematic sampling error: local greedy sampling is prone to collapsing into "Negative Pivotal Windows"--irreversible local optima with high local probability that sever global success pathways. To address this, we propose Context-Aware Power Sampling (CAPS), a training-free inference-time computation framework. CAPS leverages power distributions to sharpen global trajectory probabilities, enabling lookahead search over the model's conditional generative trajectory distribution. Furthermore, we introduce a metacognitive control mechanism based on Signal-to-Noise Ratio (SNR). This mechanism triggers adaptive MCMC search solely when drift risk is detected, enabling a dynamic transition from "intuitive fast thinking" to "rational slow search." Experiments on RoboTwin, Simpler-WindowX, and Libero-long benchmarks show that CAPS achieves substantial improvements over strong baselines, including OpenVLA and TACO, without parameter updates. These results support the effectiveness of adaptive inference-time computation for improving long-horizon robustness in embodied control.

Kewei Chen, Yayu Long, Mingsheng Shang• 2026

Related benchmarks

TaskDatasetResultRank
Long-horizon robotic instruction followingLIBERO Long
Soup and Sauce in Basket Success Rate100
5
Robot ManipulationRoboTwin 2.0
Adjust Bottle Success Rate95
4
Bimanual Robot ManipulationRoboTwin 1.0
Block Handover67
3
Robot ManipulationReal-World Benchmark 10 Tasks
Sequential Success Rate75
3
Showing 4 of 4 rows

Other info

Follow for update