Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MP1: MeanFlow Tames Policy Learning in 1-step for Robotic Manipulation

About

In robot manipulation, robot learning has become a prevailing approach. However, generative models within this field face a fundamental trade-off between the slow, iterative sampling of diffusion models and the architectural constraints of faster Flow-based methods, which often rely on explicit consistency losses. To address these limitations, we introduce MP1, which pairs 3D point-cloud inputs with the MeanFlow paradigm to generate action trajectories in one network function evaluation (1-NFE). By directly learning the interval-averaged velocity via the "MeanFlow Identity", our policy avoids any additional consistency constraints. This formulation eliminates numerical ODE-solver errors during inference, yielding more precise trajectories. MP1 further incorporates CFG for improved trajectory controllability while retaining 1-NFE inference without reintroducing structural constraints. Because subtle scene-context variations are critical for robot learning, especially in few-shot learning, we introduce a lightweight Dispersive Loss that repels state embeddings during training, boosting generalization without slowing inference. We validate our method on the Adroit and Meta-World benchmarks, as well as in real-world scenarios. Experimental results show MP1 achieves superior average task success rates, outperforming DP3 by 10.2% and FlowPolicy by 7.3%. Its average inference time is only 6.8 ms-19x faster than DP3 and nearly 2x faster than FlowPolicy. Our project page is available at https://mp1-2254.github.io/, and the code can be accessed at https://github.com/LogSSim/MP1.

Juyi Sheng, Ziyi Wang, Peiming Li, Mengyuan Liu• 2025

Related benchmarks

TaskDatasetResultRank
Robot ManipulationAdroit
Pen Task Score58
50
Robotic ManipulationAdroit and MetaWorld
Average Success Rate78.6
21
Robot ManipulationMetaWorld 50 tasks
Success Rate (Easy)88.2
21
3D pointcloud manipulationMetaWorld
Success Rate (Easy)88.2
17
Robotic ManipulationMeta-World
Success Rate (Easy)85.8
16
Robot Policy Inference EfficiencyNVIDIA RTX 4090 simulation (inference)
Inference Time (ms)4.1
12
Robot Policy Inference EfficiencyNVIDIA RTX 2080 physical robot deployment (inference)
Inference Time (ms)21.4
12
Tool-based ManipulationDexArt
DexArt Avg Success Rate61
11
Robot ManipulationAdroit 3 tasks
Hammer Success Rate100
10
3D pointcloud manipulationAdroit
Success Rate82
8
Showing 10 of 17 rows

Other info

Follow for update