Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Universal Actions for Enhanced Embodied Foundation Models

About

Training on diverse, internet-scale data is a key factor in the success of recent large foundation models. Yet, using the same recipe for building embodied agents has faced noticeable difficulties. Despite the availability of many crowd-sourced embodied datasets, their action spaces often exhibit significant heterogeneity due to distinct physical embodiment and control interfaces for different robots, causing substantial challenges in developing embodied foundation models using cross-domain data. In this paper, we introduce UniAct, a new embodied foundation modeling framework operating in a Universal Action Space. Our learned universal actions capture the generic atomic behaviors across diverse robots by exploiting their shared structural features, and enable enhanced cross-domain data utilization and cross-embodiment generalizations by eliminating the notorious heterogeneity. The universal actions can be efficiently translated back to heterogeneous actionable commands by simply adding embodiment-specific details, from which fast adaptation to new robots becomes simple and straightforward. Our 0.5B instantiation of UniAct outperforms 14X larger SOTA embodied foundation models in extensive evaluations on various real-world and simulation robots, showcasing exceptional cross-embodiment control and adaptation capability, highlighting the crucial benefit of adopting universal actions. Project page: https://github.com/2toinf/UniAct

Jinliang Zheng, Jianxiong Li, Dongxiu Liu, Yinan Zheng, Zhihao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan• 2025

Related benchmarks

TaskDatasetResultRank
Robot ManipulationLIBERO
Goal Achievement77
700
Dexterous Hand ControlAdroit
Overall Avg Success Rate49
19
Language-conditioned manipulationLIBERO
Spatial Success Rate65
18
Robot ManipulationLIBERO (All four suites (combined))
Spatial Success Rate65
18
Dexterous Hand ManipulationDexArt
Success Rate55
12
Dexterous ManipulationBi-DexHands
Success Rate47
6
Dexterous ManipulationAdroit, DexArt, and Bi-DexHands
Average Success50
6
Universal NavigationMatterport3D 18-task suite LENL settings (averaged)
Average Success Rate50
4
Robotic ManipulationWidowX Evaluation Tasks
Visual Generalization Score6.8
3
Showing 9 of 9 rows

Other info

Code

Follow for update