Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

About

We introduce Being-H0.5, a foundational Vision-Language-Action (VLA) model designed for robust cross-embodiment generalization across diverse robotic platforms. While existing VLAs often struggle with morphological heterogeneity and data scarcity, we propose a human-centric learning paradigm that treats human interaction traces as a universal "mother tongue" for physical interaction. To support this, we present UniHand-2.0, the largest embodied pre-training recipe to date, comprising over 35,000 hours of multimodal data across 30 distinct robotic embodiments. Our approach introduces a Unified Action Space that maps heterogeneous robot controls into semantically aligned slots, enabling low-resource robots to bootstrap skills from human data and high-resource platforms. Built upon this human-centric foundation, we design a unified sequential modeling and multi-task pre-training paradigm to bridge human demonstrations and robotic execution. Architecturally, Being-H0.5 utilizes a Mixture-of-Transformers design featuring a novel Mixture-of-Flow (MoF) framework to decouple shared motor primitives from specialized embodiment-specific experts. Finally, to make cross-embodiment policies stable in the real world, we introduce Manifold-Preserving Gating for robustness under sensory shift and Universal Async Chunking to universalize chunked control across embodiments with different latency and control profiles. We empirically demonstrate that Being-H0.5 achieves state-of-the-art results on simulated benchmarks, such as LIBERO (98.9%) and RoboCasa (53.9%), while also exhibiting strong cross-embodiment capabilities on five robotic platforms.

Hao Luo, Ye Wang, Wanpeng Zhang, Sipeng Zheng, Ziheng Xi, Chaoyi Xu, Haiweng Xu, Haoqi Yuan, Chi Zhang, Yiqing Wang, Yicheng Feng, Zongqing Lu• 2026

Related benchmarks

Task	Dataset	Result
Robot Manipulation	LIBERO	Object Achievement99.6	1025
Robotic Manipulation	LIBERO-Plus	Language Understanding Score81.8	414
Robotic Manipulation	LIBERO	Long-horizon Success Rate96.6	165
Robotic Manipulation	LIBERO v1 (test)	Average Success Rate98.9	118
Robotic Manipulation	RoboCasa	Average Success Rate53.9	68
Robotic Manipulation	LIBERO 1.0 (test)	Long96.2	57
Robot Manipulation	RoboCasa-GR1	Average Success Rate53.3	20
Tabletop manipulation	LIBERO	Success Rate98.9	17
Long-horizon household tasks	RoboCasa 24-task benchmark Human-50 few-shot	Pick & Place Success Rate40	9
Manipulation	Calvin ABC->D	Avg. Completed Tasks4.48	8

Showing 10 of 15 rows

Other info

GitHub

Follow for update

@wizwand_team Discord