Large Continual Instruction Assistant

About

Continual Instruction Tuning (CIT) is adopted to continually instruct Large Models to follow human intent data by data. It is observed that existing gradient update would heavily destroy the performance on previous datasets during CIT process. Instead, Exponential Moving Average (EMA), owns the ability to trace previous parameters, which can aid in decreasing forgetting. Nonetheless, its stable balance weight fails to deal with the ever-changing datasets, leading to the out-of-balance between plasticity and stability. In this paper, we propose a general continual instruction tuning framework to address the challenge. Starting from the trade-off prerequisite and EMA update, we propose the plasticity and stability ideal condition. Based on Taylor expansion in the loss function, we find the optimal balance weight can be automatically determined by the gradients and learned parameters. Therefore, we propose a stable-plasticity balanced coefficient to avoid knowledge interference. Based on the semantic similarity of the instructions, we can determine whether to retrain or expand the training parameters and allocate the most suitable parameters for the testing instances. Extensive experiments across multiple continual instruction tuning benchmarks demonstrate that our approach not only enhances anti-forgetting capabilities but also significantly improves overall continual tuning performance. Our code is available at https://github.com/JingyangQiao/CoIN.

Jingyang Qiao, Zhizhong Zhang, Xin Tan, Yanyun Qu, Shouhong Ding, Yuan Xie• 2024

Related benchmarks

Task	Dataset	Result
Object Hallucination Evaluation	POPE	--	2056
Multimodal Understanding	MMBench	Accuracy54.3	887
Multi-discipline Multimodal Understanding	MMMU	Accuracy22.9	422
Mathematical Reasoning	MathVista	Accuracy24	382
Mathematical Reasoning	MathVision	Accuracy11.6	168
Multimodal Understanding	SEEDBench2 Plus	Accuracy33.29	138
OCR-based Visual Question Answering	OCRVQA	Mean Accuracy34.31	63
Multi-image Dialogue Understanding	MMDU	Accuracy10.4	18
Knowledge Adaptation	EVOKE	CEM14.5	12
General Multimodal Understanding	MME	Normalized Score50.65	12

Showing 10 of 16 rows

Other info

Follow for update

@wizwand_team Discord