BEACON: Cross-Domain Co-Training of Generative Robot Policies via Best-Effort Adaptation
About
We introduce BEACON--Best-Effort Adaptation for Cross-Domain Co-Training--a theory-driven framework for training generative robot policies with abundant source demonstrations and limited target demonstrations. BEACON casts cross-domain co-training as a discrepancy-aware importance-reweighting problem, jointly learning a diffusion-based visuomotor policy and per-sample source weights that minimize an objective informed by target-domain generalization guarantees. To make best-effort adaptation practical for high-dimensional sequence policies, we develop scalable instance-level discrepancy estimators, stochastic alternating updates for policy and weights, and a multi-source extension that balances heterogeneous source domains. Across sim-to-sim, sim-to-real, and multi-source manipulation settings, BEACON improves robustness and data efficiency over target-only, fixed-ratio co-training, and feature-alignment baselines. Importantly, even without an explicit alignment objective, BEACON achieves feature alignment as an implicit result of discrepancy-aware cross-domain co-training.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Block-stacking | Sim-to-Real (P) | Success Rate (R)85 | 7 | |
| Block-stacking | Sim-to-Real P-OOD (evaluation) | Success Rate (R)45 | 7 | |
| Mug Cleanup | Sim-to-Real (P) | Success Rate80 | 7 | |
| Mug Cleanup | Sim-to-Real OOD P-OOD (out-of-distribution evaluation) | Success Rate60 | 7 | |
| Block-stacking | Sim-to-sim Block Stacking Texture Gap | Success Rate (R)89 | 6 | |
| Block-stacking | Sim-to-sim Block Stacking Texture + Viewpoint Gap | Success Rate (R)73 | 6 | |
| Mug Cleanup | Sim-to-sim Mug Cleanup Texture Gap | Success Rate51 | 6 | |
| Multi-task Robotic Manipulation | Sim-to-sim Texture Gap | Average Success Rate64 | 6 | |
| Multi-task Robotic Manipulation | Sim-to-sim Aggregate Performance Texture + Viewpoint Gap | Average Success Rate45 | 6 | |
| Threading | Sim-to-sim Threading Texture Gap | Success Rate53 | 6 |