Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CAR: Class-aware Regularizations for Semantic Segmentation

About

Recent segmentation methods, such as OCR and CPNet, utilizing "class level" information in addition to pixel features, have achieved notable success for boosting the accuracy of existing network modules. However, the extracted class-level information was simply concatenated to pixel features, without explicitly being exploited for better pixel representation learning. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. In this paper, aiming to use class level information more effectively, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Three novel loss functions are proposed. The first loss function encourages more compact class representations within each class, the second directly maximizes the distance between different class centers, and the third further pushes the distance between inter-class centers and pixels. Furthermore, the class center in our approach is directly generated from ground truth instead of from the error-prone coarse prediction. Our method can be easily applied to most existing segmentation models during training, including OCR and CPNet, and can largely improve their accuracy at no additional inference overhead. Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2.23% mIOU with superior generalization ability. The complete code is available at https://github.com/edwardyehuang/CAR.

Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Xiangjian He, Linchao Bao• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K
mIoU5.4
567
Semantic segmentationPascal Context (test)--
223
Semantic segmentationPascal Context--
217
Semantic segmentationPascal Context 59--
79
Semantic segmentationPASCAL VOC with background category VOC21 2012
mIoU48.6
51
Semantic segmentationCOCO-Stuff-10K (test)--
47
Semantic segmentationPascal Context 60 with background
mIoU13.6
43
Semantic segmentationPascal VOC without background 2012 V20
mIoU73.7
42
Semantic segmentationCOCO-Object with background class
mIoU15.4
34
Semantic segmentationPascal Context 59 (PC59) without background
mIoU18.4
20
Showing 10 of 10 rows

Other info

Code

Follow for update