Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

About

RGB thermal scene parsing has recently attracted increasing research interest in the field of computer vision. However, most existing methods fail to perform good boundary extraction for prediction maps and cannot fully use high level features. In addition, these methods simply fuse the features from RGB and thermal modalities but are unable to obtain comprehensive fused features. To address these problems, we propose an edge-aware guidance fusion network (EGFNet) for RGB thermal scene parsing. First, we introduce a prior edge map generated using the RGB and thermal images to capture detailed information in the prediction map and then embed the prior edge information in the feature maps. To effectively fuse the RGB and thermal information, we propose a multimodal fusion module that guarantees adequate cross-modal fusion. Considering the importance of high level semantic information, we propose a global information module and a semantic information module to extract rich semantic information from the high-level features. For decoding, we use simple elementwise addition for cascaded feature fusion. Finally, to improve the parsing accuracy, we apply multitask deep supervision to the semantic and boundary maps. Extensive experiments were performed on benchmark datasets to demonstrate the effectiveness of the proposed EGFNet and its superior performance compared with state of the art methods. The code and results can be found at https://github.com/ShaohuaDong2021/EGFNet.

Wujie Zhou, Shaohua Dong, Caie Xu, Yaguan Qian• 2021

Related benchmarks

TaskDatasetResultRank
Semantic segmentationMFNet (test)
mIoU54.8
134
Semantic segmentationPST900 (test)
mIoU78.51
72
Semantic segmentationFMB (test)
mIoU47.3
59
Semantic segmentationPST900
mIoU78.51
30
Semantic segmentationMFNet 1.0 (test)
Car IoU87.6
18
Scene ParsingPST900 (test)
mIoU78.5
11
RGB-T video semantic segmentationMVSeg
mIoU (%)53.4
9
Semantic segmentationMFNet (test)
Params (M)62.78
9
Showing 8 of 8 rows

Other info

Follow for update