Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems
About
Multispectral pedestrian detection is capable of adapting to insufficient illumination conditions by leveraging color-thermal modalities. On the other hand, it is still lacking of in-depth insights on how to fuse the two modalities effectively. Compared with traditional pedestrian detection, we find multispectral pedestrian detection suffers from modality imbalance problems which will hinder the optimization process of dual-modality network and depress the performance of detector. Inspired by this observation, we propose Modality Balance Network (MBNet) which facilitates the optimization process in a much more flexible and balanced manner. Firstly, we design a novel Differential Modality Aware Fusion (DMAF) module to make the two modalities complement each other. Secondly, an illumination aware feature alignment module selects complementary features according to the illumination conditions and aligns the two modality features adaptively. Extensive experimental results demonstrate MBNet outperforms the state-of-the-arts on both the challenging KAIST and CVC-14 multispectral pedestrian datasets in terms of the accuracy and the computational efficiency. Code is available at https://github.com/CalayZhou/MBNet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Object Detection | DroneVehicle (test) | mAP5071.9 | 61 | |
| Multispectral Pedestrian Detection | KAIST (test) | Overall Score65.14 | 39 | |
| Pedestrian Detection | KAIST (test) | MR (Near)0.00e+0 | 22 | |
| Oriented Object Detection | DroneVehicle (test) | AP (Car)90.1 | 19 | |
| Pedestrian Detection | KAIST multispectral pedestrian dataset (All-dataset) | Detection Rate (Near)0.00e+0 | 14 | |
| Pedestrian Detection | CVC-14 (Night) | MR^-213.5 | 11 | |
| Pedestrian Detection | CVC-14 (Day) | MR^-224.7 | 11 | |
| Object Detection | KAIST (Reasonable-Day) | Miss Rate8.62 | 11 | |
| Object Detection | KAIST Reasonable (test) | Miss Rate8.4 | 11 | |
| Object Detection | KAIST (Reasonable-Night) | Miss Rate0.0827 | 11 |