Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps

About

Multi-agent collaborative perception could significantly upgrade the perception performance by enabling agents to share complementary information with each other through communication. It inevitably results in a fundamental trade-off between perception performance and communication bandwidth. To tackle this bottleneck issue, we propose a spatial confidence map, which reflects the spatial heterogeneity of perceptual information. It empowers agents to only share spatially sparse, yet perceptually critical information, contributing to where to communicate. Based on this novel spatial confidence map, we propose Where2comm, a communication-efficient collaborative perception framework. Where2comm has two distinct advantages: i) it considers pragmatic compression and uses less communication to achieve higher perception performance by focusing on perceptually critical areas; and ii) it can handle varying communication bandwidth by dynamically adjusting spatial areas involved in communication. To evaluate Where2comm, we consider 3D object detection in both real-world and simulation scenarios with two modalities (camera/LiDAR) and two agent types (cars/drones) on four datasets: OPV2V, V2X-Sim, DAIR-V2X, and our original CoPerception-UAVs. Where2comm consistently outperforms previous methods; for example, it achieves more than $100,000 \times$ lower communication volume and still outperforms DiscoNet and V2X-ViT on OPV2V. Our code is available at https://github.com/MediaBrain-SJTU/where2comm.

Yue Hu, Shaoheng Fang, Zixing Lei, Yiqi Zhong, Siheng Chen• 2022

Related benchmarks

TaskDatasetResultRank
3D Object DetectionOPV2V
AP@0.5096.01
146
3D Object DetectionDAIR-V2X
AP@0.5078.29
117
3D Object DetectionDAIR-V2X 14 (test)
AP@0.5080.7
107
3D Object DetectionV2XSet
AP@0.5090.1
70
3D Object DetectionOPV2V 13 (test)
AP@0.589.63
64
Object DetectionIRV2V
AP@0.5086.4
48
3D Object DetectionV2X-R (test)
3D mAP (IoU=0.3)92.2
47
3D Object DetectionV2X-R (val)
3D mAP (IoU=0.3)88.05
28
Object DetectionSynthetic multi-agent BEV perception environment (test)
AP23.04
26
3D Object DetectionDAIR-V2X 46 (test)
AP@0.576.13
21
Showing 10 of 56 rows

Other info

Code

Follow for update