Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Adding Attentiveness to the Neurons in Recurrent Neural Networks

About

Recurrent neural networks (RNNs) are capable of modeling the temporal dynamics of complex sequential information. However, the structures of existing RNN neurons mainly focus on controlling the contributions of current and historical information but do not explore the different importance levels of different elements in an input vector of a time slot. We propose adding a simple yet effective Element-wiseAttention Gate (EleAttG) to an RNN block (e.g., all RNN neurons in a network layer) that empowers the RNN neurons to have the attentiveness capability. For an RNN block, an EleAttG is added to adaptively modulate the input by assigning different levels of importance, i.e., attention, to each element/dimension of the input. We refer to an RNN block equipped with an EleAttG as an EleAtt-RNN block. Specifically, the modulation of the input is content adaptive and is performed at fine granularity, being element-wise rather than input-wise. The proposed EleAttG, as an additional fundamental unit, is general and can be applied to any RNN structures, e.g., standard RNN, Long Short-Term Memory (LSTM), or Gated Recurrent Unit (GRU). We demonstrate the effectiveness of the proposed EleAtt-RNN by applying it to the action recognition tasks on both 3D human skeleton data and RGB videos. Experiments show that adding attentiveness through EleAttGs to RNN blocks significantly boosts the power of RNNs.

Pengfei Zhang, Jianru Xue, Cuiling Lan, Wenjun Zeng, Zhanning Gao, Nanning Zheng• 2018

Related benchmarks

TaskDatasetResultRank
Skeleton-based Action RecognitionNTU RGB+D (Cross-subject)
Accuracy80.7
123
Skeleton-based Action RecognitionNTU 60 (X-view)
Accuracy88.4
119
Skeleton-based Action RecognitionNTU (Cross-Subject)
Accuracy80.7
86
Action RecognitionNorthwestern-UCLA (NUCLA) Multiview (cross-view)--
45
Skeleton-based Action RecognitionNTU RGB+D Cross-View (CV) 1.0
Accuracy88.4
38
Action RecognitionN-UCLA
Accuracy90.7
36
3D skeleton-based action recognitionSYSU CS
Accuracy85.7
20
Action RecognitionEGTEA Gaze+
Accuracy57.01
18
Action RecognitionSYSU-3D (cross-subject)
Accuracy85.7
14
Egocentric Activity RecognitionGTEA 61
Accuracy66.77
14
Showing 10 of 16 rows

Other info

Follow for update