Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding

Hua, Ting; Shen, Yilin; Zhao, Changsheng; Hsu, Yen-Chang; Jin, Hongxia

doi:10.18653/v1/2021.naacl-main.212

Computer Science > Computation and Language

arXiv:2201.01420 (cs)

[Submitted on 5 Jan 2022]

Title:Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding

Authors:Ting Hua, Yilin Shen, Changsheng Zhao, Yen-Chang Hsu, Hongxia Jin

View PDF

Abstract:Domain classification is the fundamental task in natural language understanding (NLU), which often requires fast accommodation to new emerging domains. This constraint makes it impossible to retrain all previous domains, even if they are accessible to the new model. Most existing continual learning approaches suffer from low accuracy and performance fluctuation, especially when the distributions of old and new data are significantly different. In fact, the key real-world problem is not the absence of old data, but the inefficiency to retrain the model with the whole old dataset. Is it potential to utilize some old data to yield high accuracy and maintain stable performance, while at the same time, without introducing extra hyperparameters? In this paper, we proposed a hyperparameter-free continual learning model for text data that can stably produce high performance under various environments. Specifically, we utilize Fisher information to select exemplars that can "record" key information of the original model. Also, a novel scheme called dynamical weight consolidation is proposed to enable hyperparameter-free learning during the retrain process. Extensive experiments demonstrate that baselines suffer from fluctuated performance and therefore useless in practice. On the contrary, our proposed model CCFI significantly and consistently outperforms the best state-of-the-art method by up to 20% in average accuracy, and each component of CCFI contributes effectively to overall performance.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2201.01420 [cs.CL]
	(or arXiv:2201.01420v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2201.01420
Journal reference:	Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies,pages 2669--2678
Related DOI:	https://doi.org/10.18653/v1/2021.naacl-main.212

Submission history

From: Ting Hua [view email]
[v1] Wed, 5 Jan 2022 02:46:16 UTC (4,071 KB)

Computer Science > Computation and Language

Title:Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators