Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Switchable Whitening for Deep Representation Learning

About

Normalization methods are essential components in convolutional neural networks (CNNs). They either standardize or whiten data using statistics estimated in predefined sets of pixels. Unlike existing works that design normalization techniques for specific tasks, we propose Switchable Whitening (SW), which provides a general form unifying different whitening methods as well as standardization methods. SW learns to switch among these operations in an end-to-end manner. It has several advantages. First, SW adaptively selects appropriate whitening or standardization statistics for different tasks (see Fig.1), making it well suited for a wide range of tasks without manual design. Second, by integrating benefits of different normalizers, SW shows consistent improvements over its counterparts in various challenging benchmarks. Third, SW serves as a useful tool for understanding the characteristics of whitening and standardization techniques. We show that SW outperforms other alternatives on image classification (CIFAR-10/100, ImageNet), semantic segmentation (ADE20K, Cityscapes), domain adaptation (GTA5, Cityscapes), and image style transfer (COCO). For example, without bells and whistles, we achieve state-of-the-art performance with 45.33% mIoU on the ADE20K dataset. Code is available at https://github.com/XingangPan/Switchable-Whitening.

Xingang Pan, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo• 2019

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU45.33
2731
Semantic segmentationCityscapes (test)
mIoU29.91
1145
Semantic segmentationCityscapes
mIoU28.16
578
Semantic segmentationCityscapes (val)
mIoU29.91
572
Semantic segmentationCityscapes (val)
mIoU77.3
287
Crowd CountingShanghaiTech Part A (test)
MAE102.4
227
Crowd CountingShanghaiTech Part B (test)
MAE19
191
Semantic segmentationMapillary (val)
mIoU55.82
153
Semantic segmentationCityscapes 1.0 (val)
mIoU29.91
110
Semantic segmentationBDD-100K (val)
mIoU48.49
102
Showing 10 of 69 rows

Other info

Code

Follow for update