Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Continuously Differentiable Exponential Linear Units

About

Exponential Linear Units (ELUs) are a useful rectifier for constructing deep learning architectures, as they may speed up and otherwise improve learning by virtue of not have vanishing gradients and by having mean activations near zero. However, the ELU activation as parametrized in [1] is not continuously differentiable with respect to its input when the shape parameter alpha is not equal to 1. We present an alternative parametrization which is C1 continuous for all values of alpha, making the rectifier easier to reason about and making alpha easier to tune. This alternative parametrization has several other useful properties that the original parametrization of ELU does not: 1) its derivative with respect to x is bounded, 2) it contains both the linear transfer function and ReLU as special cases, and 3) it is scale-similar with respect to alpha.

Jonathan T. Barron• 2017

Related benchmarks

TaskDatasetResultRank
Continual Supervised LearningCIFAR 5+1
Total Average Online Task Accuracy54.23
49
Continual Supervised LearningContinual ImageNet
Total Average Online Task Accuracy81.15
49
Continual Supervised LearningCIFAR Random Label
Total Average Online Task Accuracy29.64
49
Continual LearningPermuted MNIST--
32
ClassificationEvaluation Benchmark (aggregated)
Accuracy78.14
27
Continual LearningMNIST Shuffled Labels
Accuracy (ACC)37.16
22
Plasticity MeasurementLocomotion Tasks Aggregate (Ant, HalfCheetah, Humanoid) (train)
Plasticity Score (IQM)14.31
17
Showing 7 of 7 rows

Other info

Follow for update