Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives

About

Multi-task learning (MTL) seeks to learn a single model to accomplish multiple tasks by leveraging shared information among the tasks. Existing MTL models, however, have been known to suffer from negative interference among tasks. Efforts to mitigate task interference have focused on either loss/gradient balancing or implicit parameter partitioning with partial overlaps among the tasks. In this paper, we propose ETR-NLP to mitigate task interference through a synergistic combination of non-learnable primitives (NLPs) and explicit task routing (ETR). Our key idea is to employ non-learnable primitives to extract a diverse set of task-agnostic features and recombine them into a shared branch common to all tasks and explicit task-specific branches reserved for each task. The non-learnable primitives and the explicit decoupling of learnable parameters into shared and task-specific ones afford the flexibility needed for minimizing task interference. We evaluate the efficacy of ETR-NLP networks for both image-level classification and pixel-level dense prediction MTL problems. Experimental results indicate that ETR-NLP significantly outperforms state-of-the-art baselines with fewer learnable parameters and similar FLOPs across all datasets. Code is available at this \href{https://github.com/zhichao-lu/etr-nlp-mtl}.

Chuntao Ding, Zhichao Lu, Shangguang Wang, Ran Cheng, Vishnu Naresh Boddeti• 2023

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU Depth V2--
177
Facial Attribute ClassificationCelebA--
163
Surface Normal PredictionNYU V2
Mean Error28.92
100
Depth EstimationCityscapes
Abs. Err.0.0136
22
Semantic segmentationNYU V2
mIoU20.37
14
Semantic segmentationCityscapes
mIoU61.49
8
8 grouped facial attributes classificationCelebA
Precision0.727
7
Showing 7 of 7 rows

Other info

Follow for update