Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to drive from a world on rails

About

We learn an interactive vision-based driving policy from pre-recorded driving logs via a model-based approach. A forward model of the world supervises a driving policy that predicts the outcome of any potential driving trajectory. To support learning from pre-recorded logs, we assume that the world is on rails, meaning neither the agent nor its actions influence the environment. This assumption greatly simplifies the learning problem, factorizing the dynamics into a nonreactive world model and a low-dimensional and compact forward model of the ego-vehicle. Our approach computes action-values for each training trajectory using a tabular dynamic-programming evaluation of the Bellman equations; these action-values in turn supervise the final vision-based driving policy. Despite the world-on-rails assumption, the final driving policy acts well in a dynamic and reactive world. At the time of writing, our method ranks first on the CARLA leaderboard, attaining a 25% higher driving score while using 40 times less data. Our method is also an order of magnitude more sample-efficient than state-of-the-art model-free reinforcement learning techniques on navigational tasks in the ProcGen benchmark.

Dian Chen, Vladlen Koltun, Philipp Kr\"ahenb\"uhl• 2021

Related benchmarks

TaskDatasetResultRank
Autonomous DrivingLongest6
DS23.6
35
Autonomous DrivingCARLA 42 routes
Driving Score67.64
17
Autonomous DrivingLongest6 36 routes 1.0
DS0.1736
17
Autonomous DrivingCARLA Town05 (Long)
DS44.8
15
Autonomous DrivingCARLA Town05 (Short)
DS Score64.79
15
Autonomous DrivingCARLA Leaderboard official 1.0 (test)
Driving Score31
15
Autonomous DrivingCARLA Leaderboard May 2022 (public)
Driving Score31.37
11
Autonomous Drivingpublic CARLA leaderboard Nov 2022 (test)
Driving Score0.3137
11
Autonomous DrivingCARLA Jun 2022 (public leaderboard)
Driving Score0.3137
10
Autonomous DrivingLongest6 (train towns)
DS21
9
Showing 10 of 18 rows

Other info

Code

Follow for update