Real-Time Execution of Action Chunking Flow Policies
About
Modern AI systems, especially those interacting with the physical world, increasingly require real-time performance. However, the high latency of state-of-the-art generalist models, including recent vision-language action models (VLAs), poses a significant challenge. While action chunking has enabled temporal consistency in high-frequency control tasks, it does not fully address the latency problem, leading to pauses or out-of-distribution jerky movements at chunk boundaries. This paper presents a novel inference-time algorithm that enables smooth asynchronous execution of action chunking policies. Our method, real-time chunking (RTC), is applicable to any diffusion- or flow-based VLA out of the box with no re-training. It generates the next action chunk while executing the current one, "freezing" actions guaranteed to execute and "inpainting" the rest. To test RTC, we introduce a new benchmark of 12 highly dynamic tasks in the Kinetix simulator, as well as evaluate 6 challenging real-world bimanual manipulation tasks. Results demonstrate that RTC is fast, performant, and uniquely robust to inference delay, significantly improving task throughput and enabling high success rates in precise tasks $\unicode{x2013}$ such as lighting a match $\unicode{x2013}$ even in the presence of significant latency. See https://pi.website/research/real_time_chunking for videos.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Grasp-and-place | Grasp Hard | Average Completion Progress75.3 | 5 | |
| Grasp-and-place | Grasp Medium | Avg Completion Progress84.8 | 5 | |
| Grasp-and-place | Grasp-Easy | Average Completion Progress82.3 | 5 | |
| Bowls manipulation | Robot manipulation (real-world) | Task Score8.68 | 2 | |
| Drawer Manipulation | Real-world robot manipulation | Task Score9.2 | 2 | |
| PickPlace manipulation | Robot manipulation (real-world) | Task Score9.47 | 2 | |
| Pour manipulation | Robot manipulation (real-world) | Task Score9.34 | 2 | |
| Towel manipulation | Robot manipulation (real-world) | Task Score7.33 | 2 |