FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras

About

Driving requires interacting with road agents and predicting their future behaviour in order to navigate safely. We present FIERY: a probabilistic future prediction model in bird's-eye view from monocular cameras. Our model predicts future instance segmentation and motion of dynamic agents that can be transformed into non-parametric future trajectories. Our approach combines the perception, sensor fusion and prediction components of a traditional autonomous driving stack by estimating bird's-eye-view prediction directly from surround RGB monocular camera inputs. FIERY learns to model the inherent stochastic nature of the future solely from camera driving data in an end-to-end manner, without relying on HD maps, and predicts multimodal future trajectories. We show that our model outperforms previous prediction baselines on the NuScenes and Lyft datasets. The code and trained models are available at https://github.com/wayveai/fiery.

Anthony Hu, Zak Murez, Nikhil Mohan, Sof\'ia Dudas, Jeffrey Hawke, Vijay Badrinarayanan, Roberto Cipolla, Alex Kendall• 2021

Related benchmarks

Task	Dataset	Result
Semantic segmentation	nuScenes (val)	--	323
LiDAR Semantic Segmentation	nuScenes official (test)	mIoU39.9	196
Instance-aware occupancy flow prediction	nuScenes (val)	IoU37	26
BeV Segmentation	nuScenes v1.0 (val)	Drivable Area71.97	25
Map Segmentation	nuScenes (val)	--	23
BeV Segmentation	nuScenes (val)	Vehicle Segmentation Score38	16
Vehicle Segmentation	nuScenes (val)	mIoU38.2	14
BeV vehicle segmentation	nuScenes (val)	--	11
Vehicle Segmentation	nuScenes Setting 1: 100m x 50m at 25cm resolution v1.0-trainval (val)	mIoU37.7	7
Vehicle Segmentation	nuScenes Setting 2: 100m x 100m at 50cm resolution v1.0-trainval (val)	mIoU35.8	7

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord