Latent Multi-task Architecture Learning

About

Multi-task learning (MTL) allows deep neural networks to learn from related tasks by sharing parameters with other networks. In practice, however, MTL involves searching an enormous space of possible parameter sharing architectures to find (a) the layers or subspaces that benefit from sharing, (b) the appropriate amount of sharing, and (c) the appropriate relative weights of the different task losses. Recent work has addressed each of the above problems in isolation. In this work we present an approach that learns a latent multi-task architecture that jointly addresses (a)--(c). We present experiments on synthetic data and data from OntoNotes 5.0, including four different tasks and seven different domains. Our extension consistently outperforms previous approaches to learning latent architectures for multi-task problems and achieves up to 15% average error reductions over common approaches to MTL.

Sebastian Ruder, Joachim Bingel, Isabelle Augenstein, Anders S{\o}gaard• 2017

Related benchmarks

Task	Dataset	Result
Semantic segmentation	Cityscapes (test)	mIoU39.8	1252
Depth Estimation	NYU v2 (test)	--	435
Semantic segmentation	NYU v2 (test)	mIoU40.8	282
Surface Normal Estimation	NYU v2 (test)	Mean Angle Distance (MAD)15.3	224
Depth Estimation	NYU Depth V2	--	209
Surface Normal Prediction	NYU V2	Mean Error14.2	123
Monocular Depth Estimation	Cityscapes	Accuracy (delta < 1.25)68.9	74
Semantic segmentation	NYU V2	mIoU23.8	74
Multi-Task Adaptation	Pascal Context (test)	--	70
Multi-task Learning	Cityscapes (test)	--	43

Showing 10 of 44 rows

Other info

Follow for update

@wizwand_team Discord