Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Large Continual Instruction Assistant

About

Continual Instruction Tuning (CIT) is adopted to continually instruct Large Models to follow human intent data by data. It is observed that existing gradient update would heavily destroy the performance on previous datasets during CIT process. Instead, Exponential Moving Average (EMA), owns the ability to trace previous parameters, which can aid in decreasing forgetting. Nonetheless, its stable balance weight fails to deal with the ever-changing datasets, leading to the out-of-balance between plasticity and stability. In this paper, we propose a general continual instruction tuning framework to address the challenge. Starting from the trade-off prerequisite and EMA update, we propose the plasticity and stability ideal condition. Based on Taylor expansion in the loss function, we find the optimal balance weight can be automatically determined by the gradients and learned parameters. Therefore, we propose a stable-plasticity balanced coefficient to avoid knowledge interference. Based on the semantic similarity of the instructions, we can determine whether to retrain or expand the training parameters and allocate the most suitable parameters for the testing instances. Extensive experiments across multiple continual instruction tuning benchmarks demonstrate that our approach not only enhances anti-forgetting capabilities but also significantly improves overall continual tuning performance. Our code is available at https://github.com/JingyangQiao/CoIN.

Jingyang Qiao, Zhizhong Zhang, Xin Tan, Yanyun Qu, Shouhong Ding, Yuan Xie• 2024

Related benchmarks

TaskDatasetResultRank
Object Hallucination EvaluationPOPE--
2019
Multimodal UnderstandingMMBench
Accuracy54.3
847
Mathematical ReasoningMathVista
Accuracy24
382
Multi-discipline Multimodal UnderstandingMMMU
Accuracy22.9
363
Mathematical ReasoningMathVision
Accuracy11.6
168
Multimodal UnderstandingSEEDBench2 Plus
Accuracy33.29
138
OCR-based Visual Question AnsweringOCRVQA
Mean Accuracy34.31
50
Knowledge AdaptationEVOKE
CEM14.5
12
General Multimodal UnderstandingMME
Normalized Score50.65
12
Knowledge RetentionKnowledge Retention Suite MME, MMBench, SEEDBench2 Plus, OCRVQA, ScienceQA, MMMU, MIA-Bench, MMDU, MathVista, MathVision, POPE, HallusionBench
COM52.47
12
Showing 10 of 16 rows

Other info

Follow for update