CoopDiff: A Diffusion-Guided Approach for Cooperation under Corruptions
About
Cooperative perception lets agents share information to expand coverage and improve scene understanding. However, in real-world scenarios, diverse and unpredictable corruptions undermine its robustness and generalization. To address these challenges, we introduce CoopDiff, a diffusion-based cooperative perception framework that mitigates corruptions via a denoising mechanism. CoopDiff adopts a teacher-student paradigm: the Quality-Aware Teacher performs voxel-level early fusion with Quality of Interest weighting and semantic guidance, then produces clean supervision features via a diffusion denoiser. The Dual-Branch Diffusion Student first separates ego and cooperative streams in encoding to reconstruct the teacher's clean targets. And then, an Ego-Guided Cross-Attention mechanism facilitates balanced decoding under degradation by adaptively integrating ego and cooperative features. We evaluate CoopDiff on two constructed multi-degradation benchmarks, OPV2Vn and DAIR-V2Xn, each incorporating six corruption types, including environmental and sensor-level distortions. Benefiting from the inherent denoising properties of diffusion, CoopDiff consistently outperforms prior methods across all degradation types and lowers the relative corruption error. Furthermore, it offers a tunable balance between precision and inference efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Collaborative 3D Object Detection | OPV2V | AP@0.590.53 | 20 | |
| 3D Object Detection | OPV2Vn Clean Data | AP@0.590.53 | 13 | |
| 3D Object Detection | OPV2Vn Beam Missing | AP @ IoU=0.579.09 | 13 | |
| 3D Object Detection | OPV2Vn Motion Blur | AP @ IoU=0.581.42 | 13 | |
| 3D Object Detection | OPV2Vn Fog | AP @ IoU=0.568.71 | 13 | |
| 3D Object Detection | OPV2Vn EMI | AP@0.578.91 | 13 | |
| 3D Object Detection | OPV2Vn Water | AP @ IoU=0.587.45 | 13 | |
| 3D Object Detection | OPV2Vn Echo | AP (IoU=0.5)89.23 | 13 | |
| 3D Object Detection | DAIR-V2Xn Clean Data | AP@0.580.69 | 13 | |
| 3D Object Detection | DAIR-V2Xn (Beam Missing) | AP@0.552.43 | 13 |