Event-based Vision meets Deep Learning on Steering Prediction for Self-driving Cars
About
Event cameras are bio-inspired vision sensors that naturally capture the dynamics of a scene, filtering out redundant information. This paper presents a deep neural network approach that unlocks the potential of event cameras on a challenging motion-estimation task: prediction of a vehicle's steering angle. To make the best out of this sensor-algorithm combination, we adapt state-of-the-art convolutional architectures to the output of event sensors and extensively evaluate the performance of our approach on a publicly available large scale event-camera dataset (~1000 km). We present qualitative and quantitative explanations of why event cameras allow robust steering prediction even in cases where traditional cameras fail, e.g. challenging illumination conditions and fast motion. Finally, we demonstrate the advantages of leveraging transfer learning from traditional to event-based vision, and show that our approach outperforms state-of-the-art algorithms based on standard cameras.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Classification | N-CARS (test) | Accuracy86.1 | 53 | |
| Optical Flow | MVSEC 1.0 (indoor_flying1) | EPE1.17 | 43 | |
| Optical Flow | MVSEC 1.0 (indoor_flying2) | EPE1.97 | 37 | |
| Optical Flow | MVSEC 1.0 (indoor_flying3) | EPE1.78 | 37 | |
| Object Classification | N-Caltech101 | Accuracy71.3 | 9 | |
| Semantic segmentation | Ev-Seg 250ms interval (test) | Accuracy87.09 | 5 | |
| Semantic segmentation | Ev-Seg 50ms interval (test) | Accuracy88.85 | 5 | |
| Semantic segmentation | Ev-Seg 10ms interval (test) | Accuracy85.06 | 5 | |
| Semantic segmentation | Event Segmentation dataset (test) | Accuracy88.85 | 4 |