Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

About

Lifelong learning offers a promising paradigm of building a generalist agent that learns and adapts over its lifespan. Unlike traditional lifelong learning problems in image and text domains, which primarily involve the transfer of declarative knowledge of entities and concepts, lifelong learning in decision-making (LLDM) also necessitates the transfer of procedural knowledge, such as actions and behaviors. To advance research in LLDM, we introduce LIBERO, a novel benchmark of lifelong learning for robot manipulation. Specifically, LIBERO highlights five key research topics in LLDM: 1) how to efficiently transfer declarative knowledge, procedural knowledge, or the mixture of both; 2) how to design effective policy architectures and 3) effective algorithms for LLDM; 4) the robustness of a lifelong learner with respect to task ordering; and 5) the effect of model pretraining for LLDM. We develop an extendible procedural generation pipeline that can in principle generate infinitely many tasks. For benchmarking purpose, we create four task suites (130 tasks in total) that we use to investigate the above-mentioned research topics. To support sample-efficient learning, we provide high-quality human-teleoperated demonstration data for all tasks. Our extensive experiments present several insightful or even unexpected discoveries: sequential finetuning outperforms existing lifelong learning methods in forward transfer, no single visual encoder architecture excels at all types of knowledge transfer, and naive supervised pretraining can hinder agents' performance in the subsequent LLDM. Check the website at https://libero-project.github.io for the code and the datasets.

Bo Liu, Yifeng Zhu, Chongkai Gao, Yihao Feng, Qiang Liu, Yuke Zhu, Peter Stone• 2023

Related benchmarks

TaskDatasetResultRank
Robot ManipulationLIBERO
Goal Achievement84.6
700
Lifelong Imitation LearningLIBERO Goal
Forward Transfer (FWT)55
16
Continual LearningLIBERO Object
FWT62
15
Vision-Language-Action LearningLIBERO Multi-task scenario
FAR0.62
12
Continual LearningLIBERO LONG (test)
AUC22.37
11
Robotic ManipulationSimplerWidowX zero-shot
Spoon Success Rate12.5
11
Multi-task imitation learningLIBERO Long
Success Rate44.1
11
Lifelong Imitation LearningLIBERO Object
Forward Transfer (FWT)62
9
Lifelong Imitation LearningLIBERO-50
Forward Task Transfer (FWT)32
8
Robotic ManipulationLIBERO 130 tasks across five suites
LIBERO Object Success Rate78.9
8
Showing 10 of 15 rows

Other info

Code

Follow for update