Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization

About

Global covariance pooling in convolutional neural networks has achieved impressive improvement over the classical first-order pooling. Recent works have shown matrix square root normalization plays a central role in achieving state-of-the-art performance. However, existing methods depend heavily on eigendecomposition (EIG) or singular value decomposition (SVD), suffering from inefficient training due to limited support of EIG and SVD on GPU. Towards addressing this problem, we propose an iterative matrix square root normalization method for fast end-to-end training of global covariance pooling networks. At the core of our method is a meta-layer designed with loop-embedded directed graph structure. The meta-layer consists of three consecutive nonlinear structured layers, which perform pre-normalization, coupled matrix iteration and post-compensation, respectively. Our method is much faster than EIG or SVD based ones, since it involves only matrix multiplications, suitable for parallel implementation on GPU. Moreover, the proposed network with ResNet architecture can converge in much less epochs, further accelerating network training. On large-scale ImageNet, we achieve competitive performance superior to existing counterparts. By finetuning our models pre-trained on ImageNet, we establish state-of-the-art results on three challenging fine-grained benchmarks. The source code and network models will be available at http://www.peihuali.org/iSQRT-COV

Peihua Li, Jiangtao Xie, Qilong Wang, Zilin Gao• 2017

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-1k (val)--
1453
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy88.7
536
Image ClassificationDTD
Accuracy86.1
487
Image ClassificationImageNet (val)
Top-1 Accuracy77.19
354
Fine-grained Image ClassificationStanford Cars (test)
Accuracy93.3
348
ClassificationCars
Accuracy92.4
314
Image ClassificationStanford Cars (test)
Accuracy93.3
306
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc91.4
287
Image ClassificationCUB-200-2011 (test)
Top-1 Acc88.7
276
Image ClassificationFGVC-Aircraft (test)
Accuracy89.5
231
Showing 10 of 33 rows

Other info

Code

Follow for update