Empirical Evaluation of Rectified Activations in Convolutional Network

About

In this paper we investigate the performance of different types of rectified activation functions in convolutional neural network: standard rectified linear unit (ReLU), leaky rectified linear unit (Leaky ReLU), parametric rectified linear unit (PReLU) and a new randomized leaky rectified linear units (RReLU). We evaluate these activation function on standard image classification task. Our experiments suggest that incorporating a non-zero slope for negative part in rectified activation units could consistently improve the results. Thus our findings are negative on the common belief that sparsity is the key of good performance in ReLU. Moreover, on small scale dataset, using deterministic negative slope or learning it are both prone to overfitting. They are not as effective as using their randomized counterpart. By using RReLU, we achieved 75.68\% accuracy on CIFAR-100 test set without multiple test or ensemble.

Bing Xu, Naiyan Wang, Tianqi Chen, Mu Li• 2015

Related benchmarks

Task	Dataset	Result
Image Classification	CIFAR100 (test)	--	147
Image Classification	CIFAR10 (test)	Error Rate3.84	80
Continual Supervised Learning	CIFAR Random Label	Total Average Online Task Accuracy98.02	49
Continual Supervised Learning	Continual ImageNet	Total Average Online Task Accuracy84.97	49
Continual Supervised Learning	CIFAR 5+1	Total Average Online Task Accuracy53.6	49
Continual Learning	Permuted MNIST	--	32
Continual Learning	MNIST Shuffled Labels	Accuracy (ACC)93.1	22
Plasticity Measurement	Locomotion Tasks Aggregate (Ant, HalfCheetah, Humanoid) (train)	Plasticity Score (IQM)22.55	17

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord