Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies

About

Diffusion-based imitation learning improves Behavioral Cloning (BC) on multi-modal decision-making, but comes at the cost of significantly slower inference due to the recursion in the diffusion process. It urges us to design efficient policy generators while keeping the ability to generate diverse actions. To address this challenge, we propose AdaFlow, an imitation learning framework based on flow-based generative modeling. AdaFlow represents the policy with state-conditioned ordinary differential equations (ODEs), which are known as probability flows. We reveal an intriguing connection between the conditional variance of their training loss and the discretization error of the ODEs. With this insight, we propose a variance-adaptive ODE solver that can adjust its step size in the inference stage, making AdaFlow an adaptive decision-maker, offering rapid inference without sacrificing diversity. Interestingly, it automatically reduces to a one-step generator when the action distribution is uni-modal. Our comprehensive empirical evaluation shows that AdaFlow achieves high performance with fast inference speed.

Xixi Hu, Bo Liu, Xingchao Liu, Qiang Liu• 2024

Related benchmarks

TaskDatasetResultRank
Robot ManipulationMetaWorld 50 tasks
Success Rate (Easy)50.4
21
Robot ManipulationMetaWorld Hard (6 tasks)
Success Rate12.6
18
Robot ManipulationMetaWorld Medium 11 tasks
Success Rate19.1
18
Robot ManipulationMeta-World
Latency (Easy) (ms)49.4
15
Robotic Arm ManipulationMetaWorld Very Hard
Success Rate32.3
15
Robot ManipulationMetaWorld Very Hard 5 tasks
Success Rate32.3
15
Robotic Arm ManipulationMetaWorld Easy
Success Rate50.6
15
LiftRoboMimic
Success Rate100
11
Robot ManipulationAdroit
Hammer Task Score45
11
Robot ManipulationAdroit 3 tasks
Hammer Success Rate45
10
Showing 10 of 15 rows

Other info

Follow for update