MP1: MeanFlow Tames Policy Learning in 1-step for Robotic Manipulation

About

In robot manipulation, robot learning has become a prevailing approach. However, generative models within this field face a fundamental trade-off between the slow, iterative sampling of diffusion models and the architectural constraints of faster Flow-based methods, which often rely on explicit consistency losses. To address these limitations, we introduce MP1, which pairs 3D point-cloud inputs with the MeanFlow paradigm to generate action trajectories in one network function evaluation (1-NFE). By directly learning the interval-averaged velocity via the "MeanFlow Identity", our policy avoids any additional consistency constraints. This formulation eliminates numerical ODE-solver errors during inference, yielding more precise trajectories. MP1 further incorporates CFG for improved trajectory controllability while retaining 1-NFE inference without reintroducing structural constraints. Because subtle scene-context variations are critical for robot learning, especially in few-shot learning, we introduce a lightweight Dispersive Loss that repels state embeddings during training, boosting generalization without slowing inference. We validate our method on the Adroit and Meta-World benchmarks, as well as in real-world scenarios. Experimental results show MP1 achieves superior average task success rates, outperforming DP3 by 10.2% and FlowPolicy by 7.3%. Its average inference time is only 6.8 ms-19x faster than DP3 and nearly 2x faster than FlowPolicy. Our project page is available at https://mp1-2254.github.io/, and the code can be accessed at https://github.com/LogSSim/MP1.

Juyi Sheng, Ziyi Wang, Peiming Li, Mengyuan Liu• 2025

Related benchmarks

Task	Dataset	Result
Robot Manipulation	Adroit	Pen Task Score58	50
3D pointcloud manipulation	MetaWorld	Success Rate (Easy)91.3	30
Robotic Manipulation	DexArt	Success Rate (Bucket)33	29
Robotic Manipulation	Adroit and MetaWorld	Average Success Rate78.9	28
Robotic Manipulation	Meta-World	Success Rate (Easy)85.8	27
Robot Manipulation	MetaWorld, Adroit, and Dexart Combined	Average Success Rate79.7	25
Robot Manipulation	MetaWorld 50 tasks	Success Rate (Easy)88.2	21
Dexterous Manipulation	Adroit	Hammer Success100	17
Robot Policy Inference Efficiency	NVIDIA RTX 4090 simulation (inference)	Inference Time (ms)4.1	12
Robot Policy Inference Efficiency	NVIDIA RTX 2080 physical robot deployment (inference)	Inference Time (ms)21.4	12

Showing 10 of 44 rows

Other info

Follow for update

@wizwand_team Discord