Toward Self-Learning End-to-End Dialog Systems

Zhang, Xiaoying; Peng, Baolin; Gao, Jianfeng; Meng, Helen

Computer Science > Computation and Language

arXiv:2201.06849v1 (cs)

[Submitted on 18 Jan 2022 (this version), latest version 28 Dec 2022 (v2)]

Title:Toward Self-Learning End-to-End Dialog Systems

Authors:Xiaoying Zhang, Baolin Peng, Jianfeng Gao, Helen Meng

View PDF

Abstract:End-to-end task-oriented dialog systems often suffer from out-of-distribution (OOD) inputs after being deployed in dynamic, changing, and open environments. In this work, we propose SL-Agent, a self-learning framework that combines supervised learning, reinforcement learning, and machine teaching for building end-to-end dialog systems in a more realistic changing environment setting. SL-Agent consists of a dialog model and a pre-trained reward model to judge the quality of a system response. SL-Agent enables dialog agents to automatically adapt to environments with user behavior changes by learning from human-bot interactions via reinforcement learning, with the incorporated pre-trained reward model. We validate SL-Agent in four different dialog domains. Experimental results show the effectiveness of SL-Agent for automatically adapting to changing environments using both automatic and human evaluations. Furthermore, experiments on a challenging domain extension setting demonstrate that SL-Agent can effectively adapt to new tasks using limited human corrections provided via machine teaching. We will release code, data, and pre-trained models for further research.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2201.06849 [cs.CL]
	(or arXiv:2201.06849v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2201.06849

Submission history

From: Xiaoying Zhang [view email]
[v1] Tue, 18 Jan 2022 09:56:35 UTC (1,296 KB)
[v2] Wed, 28 Dec 2022 12:59:30 UTC (5,937 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiaoying Zhang
Baolin Peng
Jianfeng Gao
Helen Meng

export BibTeX citation

Computer Science > Computation and Language

Title:Toward Self-Learning End-to-End Dialog Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Toward Self-Learning End-to-End Dialog Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators