Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning
About
We present a plug-in replacement for batch normalization (BN) called exponential moving average normalization (EMAN), which improves the performance of existing student-teacher based self- and semi-supervised learning techniques. Unlike the standard BN, where the statistics are computed within each batch, EMAN, used in the teacher, updates its statistics by exponential moving average from the BN statistics of the student. This design reduces the intrinsic cross-sample dependency of BN and enhances the generalization of the teacher. EMAN improves strong baselines for self-supervised learning by 4-6/1-2 points and semi-supervised learning by about 7/2 points, when 1%/10% supervised labels are available on ImageNet. These improvements are consistent across methods, network architectures, training duration, and datasets, demonstrating the general effectiveness of this technique. The code is available at https://github.com/amazon-research/exponential-moving-average-normalization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | ImageNet 1% labeled | Top-5 Accuracy83.4 | 118 | |
| Image Classification | ImageNet (10% labels) | Top-1 Acc74 | 98 | |
| Image Classification | ImageNet 1k (10% labels) | Top-1 Acc74 | 92 | |
| KNN Classification | ImageNet-1k (val) | Top-1 Accuracy64.9 | 53 | |
| Image Classification | ImageNet 1k (1%) | Top-1 Acc63 | 49 | |
| Image Classification | ImageNet 10% label fraction 2012 (val) | Top-1 Acc72.8 | 18 | |
| Image Classification | ImageNet 10% labels 1K (val) | Top-5 Error88.5 | 18 | |
| Image Classification | ImageNet 1% labels 1k (val) | Top-1 Accuracy57.4 | 16 | |
| Category-level retrieval | ImageNet-1k (val) | mAP47.9 | 14 | |
| Image Classification | ImageNet-1k (val) | Top-1 Acc (1% labels)63 | 9 |