Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSG: Multi-Stream Generative Policies for Sample-Efficient Robotic Manipulation

About

Generative robot policies such as Flow Matching offer flexible, multi-modal policy learning but are sample-inefficient. Although object-centric policies improve sample efficiency, it does not resolve this limitation. In this work, we propose Multi-Stream Generative Policy (MSG), an inference-time composition framework that trains multiple object-centric policies and combines them at inference to improve generalization and sample efficiency. MSG is model-agnostic and inference-only, hence widely applicable to various generative policies and training paradigms. We perform extensive experiments both in simulation and on a real robot, demonstrating that our approach learns high-quality generative policies from as few as five demonstrations, resulting in a 95% reduction in demonstrations, and improves policy performance by 89 percent compared to single-stream approaches. Furthermore, we present comprehensive ablation studies on various composition strategies and provide practical recommendations for deployment. Finally, MSG enables zero-shot object instance transfer. We make our code publicly available at https://msg.cs.uni-freiburg.de.

Jan Ole von Hartz, Lukas Schweizer, Joschka Boedecker, Abhinav Valada• 2025

Related benchmarks

TaskDatasetResultRank
open drawerReal-World (test)
Success Rate96
11
Robot ManipulationRLBench 10 demonstrations (test)
Open Drawer Success Rate97
7
Pick-&-PlaceReal-World (test)
Success Rate76
4
Pour DrinkReal-World (test)
Success Rate84
4
Store in DrawerReal-World (test)
Success Rate68
4
Sweep BlocksReal-World (test)
Success Rate88
4
Showing 6 of 6 rows

Other info

Follow for update