Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learned Two-Plane Perspective Prior based Image Resampling for Efficient Object Detection

About

Real-time efficient perception is critical for autonomous navigation and city scale sensing. Orthogonal to architectural improvements, streaming perception approaches have exploited adaptive sampling improving real-time detection performance. In this work, we propose a learnable geometry-guided prior that incorporates rough geometry of the 3D scene (a ground plane and a plane above) to resample images for efficient object detection. This significantly improves small and far-away object detection performance while also being more efficient both in terms of latency and memory. For autonomous navigation, using the same detector and scale, our approach improves detection rate by +4.1 $AP_{S}$ or +39% and in real-time performance by +5.3 $sAP_{S}$ or +63% for small objects over state-of-the-art (SOTA). For fixed traffic cameras, our approach detects small objects at image scales other methods cannot. At the same scale, our approach improves detection of small objects by 195% (+12.5 $AP_{S}$) over naive-downsampling and 63% (+4.2 $AP_{S}$) over SOTA.

Anurag Ghosh, N. Dinesh Reddy, Christoph Mertz, Srinivasa G. Narasimhan• 2023

Related benchmarks

TaskDatasetResultRank
2D Object DetectionArgoverse-HD (test)
AP30.8
22
Streaming Object DetectionArgoverse-HD streaming (test)
sAP30.7
16
Object DetectionBDD100K MOT dataset (test)
AP20.9
10
Object DetectionWALT Camera-Split (test)
AP36.4
8
Object DetectionCommuter Bus (test)
AP5084.5
5
Object TrackingArgoverse-HD (test)
MOTA44.6
4
Streaming Object DetectionArgoverse-HD
sAP30
4
Showing 7 of 7 rows

Other info

Follow for update