Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

HAD: Heterogeneity-Aware Distillation for Lifelong Heterogeneous Learning

About

Lifelong learning aims to preserve knowledge acquired from previous tasks while incorporating knowledge from a sequence of new tasks. However, most prior work explores only streams of homogeneous tasks (\textit{e.g.}, only classification tasks) and neglects the scenario of learning across heterogeneous tasks that possess different structures of outputs. In this work, we formalize this broader setting as lifelong heterogeneous learning (LHL). Departing from conventional lifelong learning, the task sequence of LHL spans different task types, and the learner needs to retain heterogeneous knowledge for different output space structures. To instantiate the LHL, we focus on LHL in the context of dense prediction (LHL4DP), a realistic and challenging scenario. To this end, we propose the Heterogeneity-Aware Distillation (HAD) method, an exemplar-free approach that preserves previously gained heterogeneous knowledge by self-distillation in each training phase. The proposed HAD comprises two complementary components, including a distribution-balanced heterogeneity-aware distillation loss to alleviate the global imbalance of prediction distribution and a salience-guided heterogeneity-aware distillation loss that concentrates learning on informative edge pixels extracted with the Sobel operator. Extensive experiments demonstrate that the proposed HAD method significantly outperforms existing methods in this new scenario.

Xuerui Zhang, Xuehao Wang, Zhan Zhuang, Linglan Zhao, Ziyue Li, Xinmin Zhang, Zhihuan Song, Yu Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Depth EstimationNYU v2 (test)--
432
Semantic segmentationNYU v2 (test)
mIoU38.7
282
Surface Normal EstimationNYU v2 (test)
Mean Angle Distance (MAD)32.55
224
Depth EstimationCityscapes
Abs. Err.0.0186
53
Depth EstimationCityscapes
Absolute Error0.0165
34
Semantic segmentationCityscapes
mIoU76.52
26
7-class Semantic SegmentationCityscapes
mIoU69.12
18
Depth EstimationTaskonomy
Depth Error0.211
18
Multi-task Sequence PerformanceCityscapes
Delta Tb5.89
18
Depth EstimationNYU Sequence 2 v2 shuffled (test)
Absolute Error0.7024
9
Showing 10 of 24 rows

Other info

Follow for update