Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

RC-GeoCP: Geometric Consensus for Radar-Camera Collaborative Perception

About

Collaborative perception (CP) enhances scene understanding through multi-agent information sharing. While LiDAR-centric systems offer precise geometry, high costs and performance degradation in adverse weather necessitate multi-modal alternatives. Despite dense visual semantics and robust spatial measurements, the synergy between cameras and 4D radar remains underexplored in collaborative settings. This work introduces RC-GeoCP, the first framework to explore the fusion of 4D radar and images in CP. To resolve misalignment caused by depth ambiguity and spatial dispersion across agents, RC-GeoCP establishes a radar-anchored geometric consensus. Specifically, Geometric Structure Rectification (GSR) aligns visual semantics with geometry derived from radar to generate spatially grounded, geometry-consistent representations. Uncertainty-Aware Communication (UAC) formulates selective transmission as a conditional entropy reduction process to prioritize informative features based on inter-agent disagreement. Finally, the Consensus-Driven Assembler (CDA) aggregates multi-agent information via shared geometric anchors to form a globally coherent representation. We establish the first unified radar-camera CP benchmark on V2X-Radar and V2X-R, demonstrating state-of-the-art performance with significantly reduced communication overhead. Code will be released soon.

Xiaokai Bai, Lianqing Zheng, Runwei Guan, Siyuan Cao, Huiliang Shen• 2026

Related benchmarks

TaskDatasetResultRank
Collaborative PerceptionV2X-Radar (val)
AP@0.5044.55
10
Collaborative PerceptionV2X-Radar (test)
AP@0.5042.61
10
Collaborative PerceptionV2X-R (val)
AP@0.5081.9
10
Showing 3 of 3 rows

Other info

Follow for update