Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pelican-VL 1.0: A Foundation Brain Model for Embodied Intelligence

About

This report presents Pelican-VL 1.0, a new family of open-source embodied brain models with parameter scales ranging from 7 billion to 72 billion. Our explicit mission is clearly stated as: To embed powerful intelligence into various embodiments. Pelican-VL 1.0 is currently the largest-scale open-source embodied multimodal brain model. Its core advantage lies in the in-depth integration of data power and intelligent adaptive learning mechanisms. Specifically, metaloop distilled a high-quality dataset from a raw dataset containing 4+ billion tokens. Pelican-VL 1.0 is trained on a large-scale cluster of 1000+ A800 GPUs, consuming over 50k+ A800 GPU-hours per checkpoint. This translates to a 20.3% performance uplift from its base model and outperforms 100B-level open-source counterparts by 10.6%, placing it on par with leading proprietary systems on well-known embodied benchmarks. We establish a novel framework, DPPO (Deliberate Practice Policy Optimization), inspired by human metacognition to train Pelican-VL 1.0. We operationalize this as a metaloop that teaches the AI to practice deliberately, which is a RL-Refine-Diagnose-SFT loop.

Yi Zhang, Che Liu, Xiancong Ren, Hanchu Ni, Shuai Zhang, Zeyuan Ding, Jiayu Hu, Hanzhe Shan, Zhenwei Niu, Zhaoyang Liu, Shuang Liu, Yue Zhao, Junbo Qi, Qinfan Zhang, Dengjie Li, Yidong Wang, Jiachen Luo, Yong Dai, Zenglin Xu, Bin Shen, Qifan Wang, Jian Tang, Xiaozhu Ju• 2025

Related benchmarks

TaskDatasetResultRank
Visual ReasoningBLINK
Accuracy56.8
76
Spatial ReasoningMindCube
Accuracy31
69
Spatial ReasoningEmbSpatial
Overall Accuracy73.2
63
Spatial ReasoningSITE
Accuracy52.3
39
Embodied Task CompletionEB-Habitat--
32
Embodied Reasoning and Question AnsweringERQA
Score39.8
30
Embodied Question AnsweringOpenEQA
Score63.3
21
Visual Question AnsweringAircopBench
Accuracy50.8
17
Visual Spatial IntelligenceVSI
Accuracy52.8
17
Spatial AptitudeSAT
Accuracy67.3
17
Showing 10 of 24 rows

Other info

Follow for update