V2X-R: Cooperative LiDAR-4D Radar Fusion with Denoising Diffusion for 3D Object Detection
About
Current Vehicle-to-Everything (V2X) systems have significantly enhanced 3D object detection using LiDAR and camera data. However, these methods suffer from performance degradation in adverse weather conditions. The weather-robust 4D radar provides Doppler and additional geometric information, raising the possibility of addressing this challenge. To this end, we present V2X-R, the first simulated V2X dataset incorporating LiDAR, camera, and 4D radar. V2X-R contains 12,079 scenarios with 37,727 frames of LiDAR and 4D radar point clouds, 150,908 images, and 170,859 annotated 3D vehicle bounding boxes. Subsequently, we propose a novel cooperative LiDAR-4D radar fusion pipeline for 3D object detection and implement it with various fusion strategies. To achieve weather-robust detection, we additionally propose a Multi-modal Denoising Diffusion (MDD) module in our fusion pipeline. MDD utilizes weather-robust 4D radar feature as a condition to prompt the diffusion model to denoise noisy LiDAR features. Experiments show that our LiDAR-4D radar fusion pipeline demonstrates superior performance in the V2X-R dataset. Over and above this, our MDD module further improved the performance of basic fusion model by up to 5.73%/6.70% in foggy/snowy conditions with barely disrupting normal performance. The dataset and code will be publicly available at: https://github.com/ylwhxht/V2X-R.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Detection | OPV2V | AP@0.5080.07 | 146 | |
| 3D Object Detection | DAIR-V2X | AP@0.5074.95 | 117 | |
| 3D Object Detection | V2X-R (test) | -- | 47 | |
| 3D Object Detection | V2X-R (val) | -- | 28 | |
| 3D Object Detection | DAIR-V2Xn (Beam Missing) | AP@0.544.48 | 13 | |
| 3D Object Detection | DAIR-V2Xn Motion Blur | AP (IoU=0.5)66.47 | 13 | |
| 3D Object Detection | DAIR-V2Xn (Fog) | AP @ IoU=0.541.79 | 13 | |
| 3D Object Detection | DAIR-V2Xn Clean Data | AP@0.575.07 | 13 | |
| 3D Object Detection | DAIR-V2Xn (Echo) | AP@0.573.21 | 13 | |
| 3D Object Detection | DAIR-V2Xn (Water) | AP @ IoU=0.567.02 | 13 |