Continual Learning for Natural Language Generation in Task-oriented Dialog Systems

About

Natural language generation (NLG) is an essential component of task-oriented dialog systems. Despite the recent success of neural approaches for NLG, they are typically developed in an offline manner for particular domains. To better fit real-life applications where new data come in a stream, we study NLG in a "continual learning" setting to expand its knowledge to new domains or functionalities incrementally. The major challenge towards this goal is catastrophic forgetting, meaning that a continually trained model tends to forget the knowledge it has learned before. To this end, we propose a method called ARPER (Adaptively Regularized Prioritized Exemplar Replay) by replaying prioritized historical exemplars, together with an adaptive regularization technique based on ElasticWeight Consolidation. Extensive experiments to continually learn new domains and intents are conducted on MultiWoZ-2.0 to benchmark ARPER with a wide range of techniques. Empirical results demonstrate that ARPER significantly outperforms other methods by effectively mitigating the detrimental catastrophic forgetting issue.

Fei Mi, Liangwei Chen, Mengjie Zhao, Minlie Huang, Boi Faltings• 2020

Related benchmarks

Task	Dataset	Result
Natural Language Processing	decaNLP Tasks seen (test)	AN Score69.78	14
Question Answering	QA Tasks seen (test)	AN52.69	14
Natural Language Processing	decaNLP Tasks (unseen)	AN'30.22	14
Question Answering	QA Tasks (unseen)	AN' Score36.79	14
Dialogue State Tracking	ToDs benchmark GPT-2 backbone (test)	JGA39.21	11
End-to-End Dialogue Modeling	ToDs (test)	Intent Accuracy77.6	11
Intent Classification	ToDs benchmark GPT-2 backbone (test)	Accuracy0.7963	11
Natural language generation	ToDs benchmark GPT-2 backbone (test)	EER6.08	11

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord