Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning long-range spatial dependencies with horizontal gated-recurrent units

About

Progress in deep learning has spawned great successes in many engineering applications. As a prime example, convolutional neural networks, a type of feedforward neural networks, are now approaching -- and sometimes even surpassing -- human accuracy on a variety of visual recognition tasks. Here, however, we show that these neural networks and their recent extensions struggle in recognition tasks where co-dependent visual features must be detected over long spatial ranges. We introduce the horizontal gated-recurrent unit (hGRU) to learn intrinsic horizontal connections -- both within and across feature columns. We demonstrate that a single hGRU layer matches or outperforms all tested feedforward hierarchical baselines including state-of-the-art architectures which have orders of magnitude more free parameters. We further discuss the biological plausibility of the hGRU in comparison to anatomical data from the visual cortex as well as human behavioral data on a classic contour detection task.

Drew Linsley, Junkyung Kim, Vijay Veerabadran, Thomas Serre• 2018

Related benchmarks

TaskDatasetResultRank
Visual ReasoningMazes Mixed
Accuracy99.69
7
Maze SolvingMazes-19
Accuracy (Mazes-19)50.26
7
Maze SolvingMazes-25
Accuracy21.36
7
Path FindingPathFinder-21
Accuracy64.21
7
Path FindingPathFinder 24
Accuracy58.35
7
Visual ReasoningPathFinder Mixed
Accuracy89.66
7
Showing 6 of 6 rows

Other info

Follow for update