Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive Rational Activations to Boost Deep Reinforcement Learning

About

Latest insights from biology show that intelligence not only emerges from the connections between neurons but that individual neurons shoulder more computational responsibility than previously anticipated. This perspective should be critical in the context of constantly changing distinct reinforcement learning environments, yet current approaches still primarily employ static activation functions. In this work, we motivate why rationals are suitable for adaptable activation functions and why their inclusion into neural networks is crucial. Inspired by recurrence in residual networks, we derive a condition under which rational units are closed under residual connections and formulate a naturally regularised version: the recurrent-rational. We demonstrate that equipping popular algorithms with (recurrent-)rational activations leads to consistent improvements on Atari games, especially turning simple DQN into a solid approach, competitive to DDQN and Rainbow.

Quentin Delfosse, Patrick Schramowski, Martin Mundt, Alejandro Molina, Kristian Kersting• 2021

Related benchmarks

TaskDatasetResultRank
Continual Supervised LearningCIFAR Random Label
Total Average Online Task Accuracy94.82
49
Continual Supervised LearningCIFAR 5+1
Total Average Online Task Accuracy40.41
49
Continual Supervised LearningContinual ImageNet
Total Average Online Task Accuracy80.65
49
Continual LearningPermuted MNIST--
32
Continual LearningMNIST Shuffled Labels
Accuracy (ACC)92.35
22
Plasticity MeasurementLocomotion Tasks Aggregate (Ant, HalfCheetah, Humanoid) (train)
Plasticity Score (IQM)22.17
17
Reinforcement LearningAtari 2600 Enduro
Mean Score1.47e+3
10
Reinforcement LearningAtari 2600
Asterix Score242
7
Reinforcement LearningAtari 2600 Kangaroo
Score2.16e+3
7
Reinforcement LearningAtari 2600 TimePilot
Score6.41e+3
6
Showing 10 of 14 rows

Other info

Code

Follow for update