Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Real-Time Execution of Action Chunking Flow Policies

About

Modern AI systems, especially those interacting with the physical world, increasingly require real-time performance. However, the high latency of state-of-the-art generalist models, including recent vision-language action models (VLAs), poses a significant challenge. While action chunking has enabled temporal consistency in high-frequency control tasks, it does not fully address the latency problem, leading to pauses or out-of-distribution jerky movements at chunk boundaries. This paper presents a novel inference-time algorithm that enables smooth asynchronous execution of action chunking policies. Our method, real-time chunking (RTC), is applicable to any diffusion- or flow-based VLA out of the box with no re-training. It generates the next action chunk while executing the current one, "freezing" actions guaranteed to execute and "inpainting" the rest. To test RTC, we introduce a new benchmark of 12 highly dynamic tasks in the Kinetix simulator, as well as evaluate 6 challenging real-world bimanual manipulation tasks. Results demonstrate that RTC is fast, performant, and uniquely robust to inference delay, significantly improving task throughput and enabling high success rates in precise tasks $\unicode{x2013}$ such as lighting a match $\unicode{x2013}$ even in the presence of significant latency. See https://pi.website/research/real_time_chunking for videos.

Kevin Black, Manuel Y. Galliker, Sergey Levine• 2025

Related benchmarks

TaskDatasetResultRank
Robotic ManipulationLIBERO Spatial Object Goal Long
Overall Success Rate (Long)25.2
82
Robotic Task CompletionKinetix
Success Rate89.2
60
Robotic ManipulationPeel Cucumber
Success Rate74
8
Robotic ManipulationWipe Vase
Success Rate100
8
Robotic ManipulationWrite Board
Success Rate100
8
Reaction Performance AnalysisReaction Capability Evaluation RTX 4090
Time To First Action (ms)80
6
Reaction Performance AnalysisReaction Capability Evaluation RTX 4060
Time To First Activation (TTFA)303.3
6
Delay-robust robot controlKinetix
Success Rate (Avg d=0-7)48.9
6
Simulated Robot Action ChunkingKinetix (full-data)
Overall Return77.7
6
Compositional Navigate-then-GraspNavigate-then-Grasp No Synthetic Trials
Gate Success80
5
Showing 10 of 27 rows

Other info

Follow for update