Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Generalist Dynamics Model for Control

About

We investigate the use of transformer sequence models as dynamics models (TDMs) for control. We find that TDMs exhibit strong generalization capabilities to unseen environments, both in a few-shot setting, where a generalist TDM is fine-tuned with small amounts of data from the target environment, and in a zero-shot setting, where a generalist TDM is applied to an unseen environment without any further training. Here, we demonstrate that generalizing system dynamics can work much better than generalizing optimal behavior directly as a policy. Additional results show that TDMs also perform well in a single-environment learning setting when compared to a number of baseline models. These properties make TDMs a promising ingredient for a foundation model of control.

Ingmar Schubert, Jingwei Zhang, Jake Bruce, Sarah Bechtle, Emilio Parisotto, Martin Riedmiller, Jost Tobias Springenberg, Arunkumar Byravan, Leonard Hasenclever, Nicolas Heess• 2023

Related benchmarks

TaskDatasetResultRank
Downstream model-based controlWalker2d OpenAI Gym (test)
Accumulated Reward207.6
8
Downstream model-based controlHopper OpenAI Gym (test)
Accumulated Reward242.3
8
Downstream model-based controlGo1 Unitree (test)
Accumulated Reward0.03
8
Dynamics PredictionWalker2D
MAE20.122
4
Dynamics PredictionHopper
MAE17.634
4
Trajectory PredictionUnitree A1 quadruped locomotion (test)
MAE10.624
4
Dynamics PredictionFranka
MAE23.686
4
Trajectory PredictionCassie bipedal jumping (test)
MAE14.51
4
Trajectory PredictionUR5 tabletop manipulation (test)
MAE18.578
4
Showing 9 of 9 rows

Other info

Follow for update