Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptation-Free Heterogeneous Collaborative Perception with Unseen Agent Configurations

About

Collaborative perception improves 3D object detection by enabling agents to share complementary observations, but most existing methods assume fixed or known collaborator encoder configurations, limiting deployment in practice. In this work, we consider an open-world setting in which auxiliary agents with unseen configurations may appear after deployment, such as different LiDAR beam counts or encoder architectures. To address this challenge, we propose ALF, a collaborative perception framework that enables zero-adaptation collaboration with unseen agent configurations by lifting lightweight box-level messages into ego-compatible auxiliary features. ALF converts auxiliary box-level messages into pseudo-BEV maps and synthesizes ego-compatible latent features by combining object-centric cues with scene context from the ego feature. On V2X-Real, under a zero-shot evaluation across 64 case studies, ALF outperforms the strongest prior baseline by 35.91% in relative mAP@0.7 while requiring only 120 bytes per agent per frame (approximately 9.6 Kbps bandwidth at 10 Hz).

Hyunchul Bae, Heejin Ahn• 2026

Related benchmarks

TaskDatasetResultRank
3D Object DetectionV2X-Real EGO+AUX2 (unseen evaluation)
mAP@0.549.9
80
3D Object DetectionV2X-Real EGO+AUX1 1.0 (seen-pair)
mAP@0.552
80
3D Object DetectionV2X-Real EGO+AUX2 zero-shot 128-beam LiDAR
mAP@0.552
52
3D Object Detection(EGO+AUX1) 128-beam LiDAR (test)
mAP@0.552
52
Cooperative 3D Object DetectionV2X-Real (EGO+AUX1)--
16
Object DetectionV2X-Real EGO+AUX2 unseen adaptation 1.0 (test)--
16
Showing 6 of 6 rows

Other info

Follow for update