ElasticFlow: One-Step Physics-Consistent Policy with Elastic Time Horizons for Language-Guided Manipulation

About

Diffusion policies have demonstrated exceptional performance in embodied AI. However, their iterative denoising process results in high latency, and existing acceleration methods often sacrifice physical consistency. To address this, we propose ElasticFlow, a distillation-free, physics-consistent one-step policy framework. We reconstruct the Mean Field Theory by directly modeling the average velocity field, enabling a direct single-step mapping from noise to action. Addressing the Temporal Heterogeneity of robotic tasks, we introduce the Elastic Time Horizons mechanism. This mechanism effectively overcomes Spectral Bias by explicitly encoding control granularity, achieving efficient alignment between semantic instructions and physical execution horizons. Experiments on benchmarks such as LIBERO, CALVIN, and RoboTwin demonstrate that ElasticFlow achieves efficient 1-NFE inference (approximately 71Hz). Furthermore, it outperforms state-of-the-art methods, including OpenVLA and $\pi_0$, on long-horizon tasks, highlighting its potential for efficient, robust, and semantically aligned control.

Kewei Chen, Yayu Long, Shuai Li, Mingsheng Shang• 2026

Related benchmarks

Task	Dataset	Result
Robotic Manipulation	LIBERO v1 (test)	Average Success Rate98.5	118
Robotic Manipulation	Calvin ABC->D	Task-1 Score97.8	71
Robot Manipulation	LIBERO LONG (test)	Success Rate97.6	27
Dual-arm manipulation	RoboTwin Short Horizon Tasks 100-130 Steps 2.0	Lift Pot Success Rate67.4	20
Dual-arm manipulation	RoboTwin Medium Horizon Tasks 150-230 Steps 2.0	Move Can Pot63.4	20
Dual-arm manipulation	RoboTwin Long & Extra Long Horizon Tasks 280-650 Steps 2.0	Handover Block62.9	20
Inference Latency and Control Frequency Analysis	NVIDIA RTX 4090	Latency (ms)14	5

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord