Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps

About

The ability to reliably perceive the environmental states, particularly the existence of objects and their motion behavior, is crucial for autonomous driving. In this work, we propose an efficient deep model, called MotionNet, to jointly perform perception and motion prediction from 3D point clouds. MotionNet takes a sequence of LiDAR sweeps as input and outputs a bird's eye view (BEV) map, which encodes the object category and motion information in each grid cell. The backbone of MotionNet is a novel spatio-temporal pyramid network, which extracts deep spatial and temporal features in a hierarchical fashion. To enforce the smoothness of predictions over both space and time, the training of MotionNet is further regularized with novel spatial and temporal consistency losses. Extensive experiments show that the proposed method overall outperforms the state-of-the-arts, including the latest scene-flow- and 3D-object-detection-based methods. This indicates the potential value of the proposed method serving as a backup to the bounding-box-based system, and providing complementary information to the motion planner in autonomous driving. Code is available at https://github.com/pxiangwu/MotionNet.

Pengxiang Wu, Siheng Chen, Dimitris Metaxas• 2020

Related benchmarks

TaskDatasetResultRank
Instance-aware occupancy flow predictionnuScenes (val)
IoU37.2
26
Motion PredictionnuScenes v1.0-trainval (test)
Mean Error0.0256
24
Motion PredictionnuScenes v1.0-trainval (test)
Mean Error0.2565
12
Motion EstimationnuScenes Static
Mean Error0.0201
11
Motion EstimationnuScenes Speed <= 5m/s
Mean Error0.2292
11
Motion PredictionWaymo Dataset
Mean Error (Static)0.0263
4
Showing 6 of 6 rows

Other info

Follow for update