GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

About

General-purpose robots need a versatile body and an intelligent mind. Recent advancements in humanoid robots have shown great promise as a hardware platform for building generalist autonomy in the human world. A robot foundation model, trained on massive and diverse data sources, is essential for enabling the robots to reason about novel situations, robustly handle real-world variability, and rapidly learn new tasks. To this end, we introduce GR00T N1, an open foundation model for humanoid robots. GR00T N1 is a Vision-Language-Action (VLA) model with a dual-system architecture. The vision-language module (System 2) interprets the environment through vision and language instructions. The subsequent diffusion transformer module (System 1) generates fluid motor actions in real time. Both modules are tightly coupled and jointly trained end-to-end. We train GR00T N1 with a heterogeneous mixture of real-robot trajectories, human videos, and synthetically generated datasets. We show that our generalist robot model GR00T N1 outperforms the state-of-the-art imitation learning baselines on standard simulation benchmarks across multiple robot embodiments. Furthermore, we deploy our model on the Fourier GR-1 humanoid robot for language-conditioned bimanual manipulation tasks, achieving strong performance with high data efficiency.

NVIDIA: Johan Bjorck, Fernando Casta\~neda, Nikita Cherniadev, Xingye Da, Runyu Ding, Linxi "Jim" Fan, Yu Fang, Dieter Fox, Fengyuan Hu, Spencer Huang, Joel Jang, Zhenyu Jiang, Jan Kautz, Kaushil Kundalia, Lawrence Lao, Zhiqi Li, Zongyu Lin, Kevin Lin, Guilin Liu, Edith Llontop, Loic Magne, Ajay Mandlekar, Avnish Narayan, Soroush Nasiriany, Scott Reed, You Liang Tan, Guanzhi Wang, Zu Wang, Jing Wang, Qi Wang, Jiannan Xiang, Yuqi Xie, Yinzhen Xu, Zhenjia Xu, Seonghyeon Ye, Zhiding Yu, Ao Zhang, Hao Zhang, Yizhou Zhao, Ruijie Zheng, Yuke Zhu• 2025

Related benchmarks

Task	Dataset	Result
Robot Manipulation	LIBERO	Object Achievement99.4	1025
Robotic Manipulation	LIBERO	Spatial Success Rate97.7	570
Robotic Manipulation	LIBERO-Plus	Language Understanding Score80.1	414
Robot Manipulation	LIBERO (test)	Average Success Rate93.9	237
Robot Manipulation	LIBERO	Spatial Success Rate97.7	223
Robotic Manipulation	LIBERO	Long-horizon Success Rate91	165
Long-horizon robot manipulation	Calvin ABCD→D	Task 1 Completion Rate89	140
Robot Manipulation	SimplerEnv WidowX	Overall Success Rate61.9	123
Robotic Manipulation	LIBERO v1 (test)	Average Success Rate93.9	118
Robot Manipulation	SimplerEnv Google Robot tasks Variant Aggregation	Average Success Rate51.5	109

Showing 10 of 323 rows

...

Other info

Follow for update

@wizwand_team Discord