Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units

About

Recently, convolutional neural networks (CNNs) have been used as a powerful tool to solve many problems of machine learning and computer vision. In this paper, we aim to provide insight on the property of convolutional neural networks, as well as a generic method to improve the performance of many CNN architectures. Specifically, we first examine existing CNN models and observe an intriguing property that the filters in the lower layers form pairs (i.e., filters with opposite phase). Inspired by our observation, we propose a novel, simple yet effective activation scheme called concatenated ReLU (CRelu) and theoretically analyze its reconstruction property in CNNs. We integrate CRelu into several state-of-the-art CNN architectures and demonstrate improvement in their recognition performance on CIFAR-10/100 and ImageNet datasets with fewer trainable parameters. Our results suggest that better understanding of the properties of CNNs can lead to significant performance improvement with a simple modification.

Wenling Shang, Kihyuk Sohn, Diogo Almeida, Honglak Lee• 2016

Related benchmarks

TaskDatasetResultRank
Continual Supervised LearningContinual ImageNet
Total Average Online Task Accuracy84.85
49
Continual Supervised LearningCIFAR Random Label
Total Average Online Task Accuracy92.9
49
Continual Supervised LearningCIFAR 5+1
Total Average Online Task Accuracy20.56
49
Continual LearningPermuted MNIST--
32
Continual LearningMNIST Shuffled Labels
Accuracy (ACC)89.47
22
Plasticity MeasurementLocomotion Tasks Aggregate (Ant, HalfCheetah, Humanoid) (train)
Plasticity Score (IQM)12
17
Showing 6 of 6 rows

Other info

Follow for update