Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Toward a Holistic Approach to Continual Model Merging

About

We present a holistic framework for Continual Model Merging (CMM) that intervenes at three critical stages: pre-merging, during merging, and post-merging-to address two fundamental challenges in continual learning. In particular, conventional approaches either maintain a growing list of per-domain task vectors, leading to scalability issues or rely solely on weight-space merging when old data is inaccessible, thereby losing crucial functional information. Our method overcomes these limitations by first fine-tuning the main model within its tangent space on domain-specific data; this linearization amplifies per-task weight disentanglement, effectively mitigating across-task interference. During merging, we leverage functional information from available optimizer states beyond mere parameter averages to avoid the need to revisit old data. Finally, a post-merging correction aligns the representation discrepancy between pre- and post-merged models, reducing bias and enhancing overall performance-all while operating under constant memory constraints without accessing historical data. Extensive experiments on standard class-incremental and domain-incremental benchmarks demonstrate that our approach not only achieves competitive performance but also provides a scalable and efficient solution to the catastrophic forgetting problem.

Hoang Phan, Sungmin Cha, Tung Lam Tran, Qi Lei• 2025

Related benchmarks

TaskDatasetResultRank
Continual LearningCIFAR100 (test)
Mean Accuracy80.21
69
Class-incremental learningCIFAR100 10 Tasks
Accuracy77.72
66
Class-incremental learningImageNet-R 5-task--
64
Class-incremental learningCUB200 10 Tasks--
59
Class-incremental learningCIFAR-100 20 tasks
Accuracy75.95
58
Class-incremental learningStanford Cars CIL, T=10 (test)
Avg Accuracy69.7
33
Class-incremental learningImageNet-R (20 tasks)
Accuracy (20 Tasks)82
32
Class-incremental learningImageNet-R 10 tasks
Accuracy (10 Tasks)82.69
31
Domain-incremental learningOffice-Home--
22
Continual LearningImageNet-R (test)
Accuracy82.69
20
Showing 10 of 19 rows

Other info

Follow for update