On Continual Model Refinement in Out-of-Distribution Data Streams

About

Real-world natural language processing (NLP) models need to be continually updated to fix the prediction errors in out-of-distribution (OOD) data streams while overcoming catastrophic forgetting. However, existing continual learning (CL) problem setups cannot cover such a realistic and complex scenario. In response to this, we propose a new CL problem formulation dubbed continual model refinement (CMR). Compared to prior CL settings, CMR is more practical and introduces unique challenges (boundary-agnostic and non-stationary distribution shift, diverse mixtures of multiple OOD data clusters, error-centric streams, etc.). We extend several existing CL approaches to the CMR setting and evaluate them extensively. For benchmarking and analysis, we propose a general sampling algorithm to obtain dynamic OOD data streams with controllable non-stationarity, as well as a suite of metrics measuring various aspects of online performance. Our experiments and detailed analysis reveal the promise and challenges of the CMR problem, supporting that studying CMR in dynamic OOD streams can benefit the longevity of deployed NLP models in production.

Bill Yuchen Lin, Sida Wang, Xi Victoria Lin, Robin Jia, Lin Xiao, Xiang Ren, Wen-tau Yih• 2022

Related benchmarks

Task	Dataset	Result
Hallucination Correction	WikiBigEdit	Error Rate (ERR)2.86	24
Hallucination Correction	UniEdit	Error Rate (ERR)2.34	24
Continual Model Refinement for Extractive Question Answering	MRQA streams (val)	EFR97.43	16
Hallucination Correction	Hallucination	Error Rate (ERR)1.45e+3	10
Question Answering	zsRE	Error Rate (ERR)56	9
Model Editing	Hallucination	TRR1.45e+3	8
Model Editing	zsRE	TRR56	7
Model Editing	SCOTUS	TRR52	7

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord