Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives

About

Multi-task learning (MTL) seeks to learn a single model to accomplish multiple tasks by leveraging shared information among the tasks. Existing MTL models, however, have been known to suffer from negative interference among tasks. Efforts to mitigate task interference have focused on either loss/gradient balancing or implicit parameter partitioning with partial overlaps among the tasks. In this paper, we propose ETR-NLP to mitigate task interference through a synergistic combination of non-learnable primitives (NLPs) and explicit task routing (ETR). Our key idea is to employ non-learnable primitives to extract a diverse set of task-agnostic features and recombine them into a shared branch common to all tasks and explicit task-specific branches reserved for each task. The non-learnable primitives and the explicit decoupling of learnable parameters into shared and task-specific ones afford the flexibility needed for minimizing task interference. We evaluate the efficacy of ETR-NLP networks for both image-level classification and pixel-level dense prediction MTL problems. Experimental results indicate that ETR-NLP significantly outperforms state-of-the-art baselines with fewer learnable parameters and similar FLOPs across all datasets. Code is available at this \href{https://github.com/zhichao-lu/etr-nlp-mtl}.

Chuntao Ding, Zhichao Lu, Shangguang Wang, Ran Cheng, Vishnu Naresh Boddeti• 2023

Related benchmarks

Task	Dataset	Result
Depth Estimation	NYU Depth V2	--	226
Facial Attribute Classification	CelebA	--	163
Surface Normal Prediction	NYU V2	Mean Error28.92	137
Depth Estimation	Cityscapes	Abs. Err.0.0136	65
Semantic segmentation	Cityscapes	mIoU61.49	33
Semantic segmentation	NYU V2	mIoU20.37	14
8 grouped facial attributes classification	CelebA	Precision0.727	7

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord