Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deep Layer Aggregation

About

Visual recognition requires rich representations that span levels from low to high, scales from small to large, and resolutions from fine to coarse. Even with the depth of features in a convolutional network, a layer in isolation is not enough: compounding and aggregating these representations improves inference of what and where. Architectural efforts are exploring many dimensions for network backbones, designing deeper or wider architectures, but how to best aggregate layers and blocks across a network deserves further attention. Although skip connections have been incorporated to combine layers, these connections have been "shallow" themselves, and only fuse by simple, one-step operations. We augment standard architectures with deeper aggregation to better fuse information across layers. Our deep layer aggregation structures iteratively and hierarchically merge the feature hierarchy to make networks with better accuracy and fewer parameters. Experiments across architectures and tasks show that deep layer aggregation improves recognition and resolution compared to existing branching and merging schemes. The code is at https://github.com/ucbdrive/dla.

Fisher Yu, Dequan Wang, Evan Shelhamer, Trevor Darrell• 2017

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (test)--
1145
Image ClassificationStanford Cars (test)
Accuracy94.1
306
Image ClassificationCUB-200-2011 (test)
Top-1 Acc85.1
276
Image ClassificationImageNet (test)--
235
Image ClassificationFGVC-Aircraft (test)--
231
Fine-grained Image ClassificationCUB-200 2011
Accuracy85.1
222
Fine-grained Image ClassificationStanford Cars
Accuracy94.1
206
Image ClassificationFGVC Aircraft
Top-1 Accuracy92.9
185
Fine grained classificationAircraft
Top-1 Acc92.9
62
Fine-grained visual classificationFGVC Aircraft
Top-1 Accuracy92.6
41
Showing 10 of 14 rows

Other info

Follow for update